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ABSTRACT 


A general model for parallel computation is developed in three 
parts. One part, the data flow graph, describes how actors which 
transform and test values are connected to the locations in a finite 
memory. Another part, an interpretation, supplies information about 
the contents of memory and the detailed nature of the transformations 
and tests. 


The third part specifies how initiations and terminations of the 
actors are allowed to occur. We define this in a general way, using 
a set of sequences of initiation and termination events to model 
control. This allows us to prove results which apply to a broad 
class of control mechanisms. 


Our major results are analogous to a theorem of Karp and Miller. 
Thelr theorem defines a class of schemata for which conflict- 
freeness is necessary and sufficient for determinacy. We use a 
weaker notion of determinacy which depends only upon the final 
contents of a subset of the memory locations. To establish 
necessity, we introduce the property of productivity which 
expresses whether individual transformations and tests contribute 
to the final results of a computation. 
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oleh A Note on Mathematical Style 

It is conventional to use single letters to identify mathematical 
objects such as functions, sets and relations. The advantage is 
primarily one of conciseness. Unfortunately, this may come at the 
expense of understandability, and almost certainly with a loss of 
readability. The number of formal o:iccts which will be introduced in 
the course of this dissertation would quickly exhaust the alphabets 
commonly available on typewriters. Worse still, the mnemonic value 
would be vanishingly small. 

We will therefore abandon the single letter convention and freely 
use words Sane phrases to identify formal objects. As often as not, 
the name we choose for a set will simply be the plural form of the 
elements it comprises. Thus, we will not feel obliged to mention that 
SCHEMA INPUTS is a set whose elements are, in fact, schema inputs. As 
a means of avoiding unintended confusion, we will capitalize formal 
names. We will use these conventions throughout with the hope that 
they will enlighten rather than confuse. 

When mathematical precision conflicts with clarity or perspective 
in our definitions or proofs, we will favor the latter. For example, 
we will talk about “edges labeled with numbers" rather than formalize 
a "function from the set of edges into the natural numbers." In all 
cases, it should be clear how the proofs and definitions could be made 
more rigorous if so desired. 

A glossary can be found at the end of this dissertation for the 


reader's convenience. 
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La 2ecl Introduction and Background 

This dissertation is, in effect, a discussion of a model for parallel 
computation. Everything a mathematician needs to know about the model can 
be found in the subsequent chapters. However, the model is more than a 
collection of abstract definitions. Its form was influenced by a number 
of areas in computer science, and a knowledge of these may help the reader 
to push through some of the abstractions and to feel comfortable with the 


model. 


1.3.1 Schematology 

Schemata are models for computation, as are Turing machines, finite 
state machines, programs written in a programming language, and so forth. 
The feature which distinguishes schemata is the lack of interpretation for 
primitive functions and predicates. To logicians, this is a familiar 
concept. For the rest of us, a common analogue is fortunately available. 


Suppose an ordinary compiler runs across a statement such as 
ee ECV ws Bae) 


Assume function f is externally defined. Although the compiler cannot 
know exactly what f does, it is not entirely in the dark. The compiler 
at least knows where f will get its inputs and where it will store its 
result. This may seem to be very little knowledge indeed, but, with a 
few more assumptions, it allows the compiler to perform transformations 
such as calling f with copies of the inputs or using u as a temporary 
work area until f has returned its result. For example, it could 


substitute something like 


ii 


Yeo: 


u< fluyw,x,'*"4y,2) 


for the original statement, and not alter the meaning of the program. 
The point to be noticed here is that the transformation is valid 
no matter how f is defined (subject to certain assumptions we will 
presently discuss.) Schematology is concerned with making statements 
about computation which remain valid regardless of the definition of 
the functions and predicates used. From the above discussion, it should 
be clear that schematology has immediate application in compiler theory. 
Although the "external routine" analogue may be useful to those 
whose background tends more towards computers than towards formal logic, 
it must not be pushed too far. There are assumptions we make about 
functions and predicates that are not always true for arbitrary programs. 
Stated simply, we assume that all functions and predicates implement 
total mathematical functions. To emphasize what is not allowed as a 
function or predicate, some elaboration is in order. A function or 
predicate name may appear many times in a schema, but it must always take 
the same number of arguments. Thus, although one could argue for the 
desirability of a generalized addition function which returns the sum of 
an arbitrary number of arguments, such constructs will not be allowed in 
schemata. Functions and predicates are presumed to be defined for all 
inputs. Thus, no function could precisely model division which is not 
defined for divisor 0. Given the same set of inputs, functions and 
predicates must always return the same results. This precludes certain 
routines with "memory" such as "clock functions" or predicates which are 


true only the first time they are used. Functions and predicates do not 
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alter their inputs, and have no "side-effects" whatever. 

From this discussion, the reader can surmise that the schemata defined 
herein are not intended to model every computation that might be possible 
using conventional programming languages. We are, instead, focusing on a 


limited, but very rich subset of such computations. 


In addition to compiler theory, there are other justifications for 
adopting a schematized model. The Turing machine is one of the best 
known models for interpreted computation. Hopelessly inefficient as a 
practical computing device, Turing machines have been most valuable in 
identifying problems which cannot be systematically solved. For example, 
we know there is no procedure for determining if an arbitrary Turing 
machine halts when started on a particular input, or whether two Turing 
machines compute the same partial function. We cannot find the "fastest" 
Turing machine which implements an arbitrary function because, in some 
cases, there is always a faster machine, and one still faster and so on 
ad infinitum. In the face of such undecidability results, we must lower 
our sights somewhat. Schematalogy sacrifices a certain amount of 
"relevance" for the possibility of answering questions analogous to the 
above. 

It would seem to be a better approach to go only half-way towards 
schemata and allow both interpreted and uninterpreted operations. After 
all, this is precisely the problem a compiler must handle. Unfortunately, 
the amount of interpretation which can be tolerated without introducing 
undecidability problems is very small. For example two counters with 


unlimited capacity are sufficient to mimic Turing machines!9, 
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1.3.2 Schematology - History 

Ianov is generally credited with the first schematized model for 
computation!5 His model treats all of storage as a monolithic entity 
which is transformed by operations. The operation to be applied is 
determined by the outcome of a number of predicates testing the store. 
Ianov showed that equivalence of his schemata was decidable. Rutledge!5 
later pointed out a correspondence between Ianov's schemata and a class 
of finite state machines. 

Luckham, Park and Paterson? developed a more familiar schematized 
model based on flow charts. Operator instructions in the flow charts 
are of the form 


label: x, + F(x, oy ee ee Pa 


with branching in the charts accomodated by transfer instructions such as 


label: p(x, ),tlabel ,flabel 


By establishing a correspondence with a class of automata, they were 
able to show that the equivalence problem for schemata using two or more 
memory locations is not decidable. However, certain restricted classes 


of flow chart schemata were identified for which equivalence is decidable. 


Schemata have also been used by Hewitt and Paterson to allow 
meaningful comparison of various control mechanisms such as iteration, 


recursion, and recursion with limited parallelism!?. 
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1.4.1 Parallelism 

Parallel processing as it exists today generally refers to concurrent 
execution of several processes which are themselves strictly sequential. 
Although this is an important trend in the efficient use of computer 
systems, it is not the level of parallelism in which we are interested. 
We will be focusing on parallelism within a program at the instruction 
level. Therefore, we are discussing hardware which is not now common and 
probably will not be common for a generation or so. 

There is more than a theoretical difference between concurrent 
execution of one thousand programs and concurrent use of one thousand 
processors by a single program. An astronaut does not care if ground 
control can recompute a thousand courses in an hour. What he requires 
is one course recomputed in seconds. The number of such real time 
applications is constantly expanding. With speeds of individual 
processors approaching limits imposed by the speed of light and the 
laws of thermodynamics, iia: RV parallelism will take on increasing 
importance. 

In addition to such practical applications, one can make an argument 
for parallelism on theoretical grounds. A conventional algorithm may 
impose sequencing constraints which are arbitrary and have nothing to 
do with the function being implemented. A parallel specification can 
help to focus on this function by stripping away arbitrary sequencing 


and leaving only that which is essential. 
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1.4.2 Parallel Schematology - History 


8 have worked with schemata which 


Karp and Miller’, Slutzl®, and Keller 
model parallel computation. Using a hybrid operation which combines the 
réles of transformation and testing, they were able to identify certain 
classes of parallel schemata for which determinacy and equivalence are 
decidable. Some of the major results of this dissertation are extensions 
of the work of Karp and Miller to a model with more conventional 
operators and a weaker form of equivalence, 

Slutz's maxtmum parallel form!® which maximized the number of possible 
computations for a given interpretation helped to motivate our notion of 
productivity. With our weaker form of equivalence, the possibility of 
useless operations arises. A workable analogue to Slutz's maximum 
parallel form would have to restrict computations to some level of 


productivity. Lacking this, unproductive activity could be added 


indefinitely without truly increasing parallelism. 


Parallel schemata in which the arrival of data triggers activity have 
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been investigated by Rodriguez!3, Dennis? 


and Fosseen®. A desirable 


feature of these schemata is their inherent determinacy. The equivalence 
problem remains undecidable for general data flow schemata. However, 
both Rodriguez and Fosseen discuss decidable questions about equivalence 


of data links within a schema. 
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Asioee Modularity 

Modularity is a term finding great favor both in computer science and 
on Madison Avenue. Informally, we might say that a modular system is the 
interconnection of a small number of components whose inputs, outputs and 
input/output behavior are explicitly defined. Although we have restricted 
modular systems to comprise only a few components, there is no real 
restriction on system size. Each component may be a modular system whose 
components are themselves systems and so forth. We are therefore talking 
about a style of systems design rather than a class of systems. 

Modular systems have the virtue of being easy to debug and easy to 
modify. If such a system malfunctions, the components can be "unplugged" 
and tested to see that they meet their input/output specifications. 

Since this is, by definition, their only réle in the system, operational 
components need not be checked in greater detail. This allows quick 
isolation of problems. Furthermore, any component can be replaced by 
another with the same input/output behavior. This allows us to take 
advantage of developments which make modules cheaper, faster, more 
reliable, or whatever. The value of such capabilities in large systems 
can scarcely be overestimated. 

The definition of equivalent schemata reflects our input/output 
oriercation. We view schemata much as a programmer views subroutines. 
When given inputs, they either run on forever or they halt and produce 
some outputs. Equivalence will amount to halting on the same inputs, 
yielding the same results. Intermediate values, storage used, amount of 
parallelism exploited and the like will be of no consequence as far as 


equivalence is concerned. 
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1.6.1 Speed Independence 

Programmers on some early machines observed that they could initiate 
data input operations before they were finished with the data in the area 
to be overwritten. They knew there would be ample time to access the 
data before the relatively slow input devices could effect the transfer. 
This technique was actually an early form of parallel processing, but a 
decidedly dangerous one. For example, if the I/0 device had been 
improved or the program siuutaved rather than executed, chaos could 
have resulted. 

The peculiarities of particular machines and operating systems now 
make any assumptions about relative speeds highly suspect. Our model 
will not assume that all operations take the same amount of time or 
even that a particular operation always terminates after a fixed delay 
after it has initiated. In fact, our model will make no mention of 
delays and timing. Instead, each action will have distinct, explicit 
initiation and termination events. All timing considerations will be 
modeled by the sequencing of these events. The model will therfore 
possess enough flexiblity to describe systems where the amount of time 


needed to complete an operation may be highly variable. 
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2.1.4 Data Flow Graphs - Introduction 

If schemata are a model for computation as we suggested in the 
introduction, then we should begin with a broad-brush approximation to 
just what a computation is. Let us informally define a computation as 
a process which performs transformations and tests on a set of values. 
This is obviously not to be taken too seriously, but it will serve as a 
foundation upon which to build. 

For @xample, the definition suggests that a model must have agents 
capable of transforming values and agents capable of testing values. 

We refer to these agents as operators and deciders respectively, and 
call them aetors collectively. As we noted in the introduction, with 
our schematized approach we do not define exactly what these actors do 
or even the set of values they act upon. However, by associating 
function names with operators and predicate names with deciders, we can 
constrain distinct actors to do the same thing, whatever that may be, 
by giving them the same function name or predicate name. 

If an operator performs a transformation of m values, we will 
depict the operator as a circle with its associated function name inside 
and with input arcs labeled 1 through m. An operator having no inputs 
is allowable: it allows us to model constants. Deciders are similar 
but they will be diamond-shaped and we insist that they have at least 
one input: they must have something upon which to base a decision. Of 
course, all actors with which a given name is associated must have the 
same number of inputs. 


We assume that all operators have exactly one output. There is no 


real loss of generality, since we could always model a transformation 
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with k outputs by k single-output operators. Deciders, on the other 
hand, have no outputs. They have outcomes, either true or false. 
Again, binary deciders can be put together in such a way as to model 
k-way tests. Figure 2.1 shows an operator and a decider. 

In our model, the values input to actors and produced by operators 
are presumed to reside in memory locations. Which location supplies 
which input is depicted by attaching the arcs on actors to memory 
locations. The same location may supply more than one input, and may 
also be the output location of an operator to which it supplies an 
input. Some actor/memory interconnections are shown in Figure 2.2. 

Labeling the arcs on such interconnections can be tedious and 
unenlightening. We will adopt the convention that arcs appear in 
order of increasing index starting from the bottom of the actor and 
proceding clockwise around it unless explicitly labeled otherwise. 

Thus, all arc labeling in Figure 2.2 is unnecessary. 

Presumably, we institute the series of transformations and tests 
which we called a computation because we are interested in some results. 
In general, we expect these results to depend in some way on an initial 
set of values we provide to the computation. In our actor/memory 
interconnections, we identify two ordered subsets of the memory 
locations. One set, the schema inputs, are presumed to hold the 
initial values when the computys%ion begius. The other set, the schema 
outputs, will contain the results if and when the computation terminates. 
The actor/memory interconnection and schema inputs and outputs determine 


what we will call a data flow graph. 


hed 


Figure 2.1a An operator o with Figure 2.lb A decider d with 


m-ary function name f n-ary predicate name p 


Figure 2.2 An interconnection of actors and memory locations 
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The class of computations we will be investigating have finite data 
flow graphs which are not altered during the course of computation. We 
cannot allocate additional memory locations, change the function name 
associated with an actor, or move an actor around in the graph. We do 
not claim to have a satisfactory way of representing structured data 
such as arrays, lists or stacks. These features are not deemed 
unimportant, but rather lie outside of the scope of this dissertation. 

If the reader experiments with producing a data flow graph 
corresponding to a familiar computation (as we shall do in Section 1.3), 
he will probably discover that simple assignment occurs. That is, we 
often want to simply copy the contents of one location into another. 
This is one function which is always meaningful, independent of the 
domain of values wity which ue are dealing. We therefore allow 
tdentitty operators to be used in data flow graphs. We will reserve 


teott 


the function name for these operators. 


22 


2s Mee? Data Flow Graphs - Formal Definition 

A data flow graph is a directed, labeled, bipartite graph. One set 
of nodes is a finite number of locattons collectively referred to as 
MEMORY. An ordered (possibly empty) subset of MEMORY is distinguished 
as the SCHEMA INPUTS, and another ordered (nonempty) subset is 
distinguished as the SCHEMA OUTPUTS. The other set of nodes are a 
finite collection of ACTORS, partitioned into OPERATORS and DECIDERS. 
Associated with each operator is one of a set of FUNCTION NAMES. For 
simplicity, we will assume that the number of its inputs (20) is 
implicit in the function name. If f is an m-ary function name, then 
any operator o with which f is associated has input arcs labeled 1 
through m originating on (not necessarily distinct) locations in 
MEMORY. We call these locations the tmput locations of o. In 
particular, we call the location on the arc labeled i the gee input 
locatton of o. Each operator o has a single output arc terminating on 
some location in MEMORY (not necessarily disjoint from the input 
locations of o.) We refer to this as the output locatton of o. 
Associated with each decider is one of a set of PREDICATE NAMES. As 
with function names, we will assume that the number of inputs (>0) is 
implicit in the predicate name. If p is an n-ary predicate name, then 
any decider d with which p is associated has input arcs labeled 1 
through n originating on (not necessarily distinct) locations in 


MEMORY, the tnput locations of d. Deciders have no output ares. 
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21s Data Flow Graphs - Example 

It might be instructive to take a familiar model for computation, 
a program, and consider how a corresponding data flow graph could be 
defined. The program in Figure 2.3 mimics integer division. hat is, 
given a numerator N and a denominator D, it will determine a quotient Q 
and remainder Pf such that N=QxD+R, the magnitude of R is less than the 
Magnituce of D, and if Ris ne coro, it has the same sign as Q. The 
example is not profound, but tue reader should look it over since we 
will refer to it in several later sections. 

An obvious first step is to equip our data flow graph with a memory 
location for each variable in the program. Of course, we may need 
additional locations for temporary results and the like. The choice of 


actors is not guite so clear. For example, in line 6 we wish to 


increase Q by 1. This could be accomplished by adding the constant 1 
to Q via the nor-+.2 additic “metion, or by using a special 
add-l-to-the-arzument unary vunction. This obviously makes no 


difference in the program, but will lead to quite different data flow 
graphs. A similar choice occurs for the comparisons with constant 0 on 
lines 9 and 13. 

A complete data flow graph is shown in Figure 2.4. We could have 
used function names like < or - but these names are hard to dissociate 
from their conventional interpretations. Thinking of c as complement, 
£ as less than, m as magnitude, s as subtract, t as tally (add 1), and 
Z as zero, the data flow graph should be quite easy to analyze. The 
reader is cncouraged to verify line by line that the data flow of the 


program can be duplicated. The identity copies 9 for use by p and q. 
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26 fed Events - Introduction 

Although actors are the atomic elements by which computation can be 
carried out, we want a description of their behavior which is more 
detailed than "actor a happens." This level would suffice for sequential 
computations, in which there is only one site of activity. i a parallel 
computation, however, we need a mechanism capable of describing concurrent, 
asynchronous activity. 

To this end, we associate with each operator in a data flow graph, an 
initiation event and a termination event. With each decider, we associate 
an initiation event and two termination events. In essence, an initiation 
event and corresponding termination event bracket what we referred to 
earlier as "actor a happening." However, it is not necessary for 
initiations and terminations of an actor to occur in strict alternation. 
By postulating a means of keeping track of initiations for which the 
corresponding termination has not yet taken place, we can allow several 
initiations to occur before a termination occurs, 

When we formally describe events in the next section, we do so by 
assuming that each actor has an associated queue and processor which 
computes the function or predicate named. It should be emphasized that 
these processors and queues are merely vehicles for defining the 
intended behavior of actors. Any implementation which gives rise to 
the same behavior would be equally acceptable. 


we now turn to a formalization of the semantics of events. 
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2.2.2 Events - Formal Description 

For each operator o with associated n-ary function name f, we 
define two events, the tnittation of o, denoted o, and the termination 
of o, denoted o. When o initiates, a value is associated with each 
input are of o by (non-destructively) reading the corresponding input 
location of o. This n-tuple of values can be thought of as being 
enqueued on an f-processor associated with o. The termination of o 
is defined only if this fifo queue is non-empty. (It should be noted 
that a queue containing 0-tuples is most certainly not empty.) In 
this case, the first n-tuple in the queue is removed, the f-processor 
is applied to the n-tuple, and the result is written into the output 
location of o, destroying any previous contents. 

For each decider d with associated n-ary predicate name q, we 
define three events, the initiatton of d, denoted d, and the true and 
false termtnattons of d, denoted d, and d., respectively. The intended 
semantics are similar to those for operators. Upon termination, there 
is no output location to overwrite, of course, but the outcome of the 
predicate named by p on the n-tuple of values is reflected in the 
choice of termination events, 

These rules define, for each data flow graph, an alphabet = of 
events associated with the actors. Since there are only finitely 


many actors, Z is always finite. 
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2.2.3 Events - Example 

Suppose, referring back to the data flow graph of Figure 2.4, we 
consider the sequences of events wwifff and wwffff and their effect 
upon the contents of location Q. When we begin, the contents of Q are 
undefined. When w initiates, the contents of Q remain undefined, but 
now there is a 0-tuple enqueued on w's z-processor. When w terminates, 
a value which we can symbolically refer to as z() is written into Q. 

At this point, all queues are again empty. When f initiates, the 
l-tuple z() is enqueued on f's t-processor. Up until now, both of 
sequences we have been considering have caused the same behavior. 

If f now terminates immediately, t(z()) will be writts.. into Q, 
destroying its former contents, z(). After another initiation and 
termination of f, Q will contain t(t{2()}!} and 1.1 queues will be 
empty. 

On the other hand, if f reinitiates before terminating, z() is 
placed on the t-processor's queue after the other 1l-tuple (which also 
happens to be z().) The first cerminsvicen of f leads to t(z()) being 
written into Q. The second termination then overwrites this value 
with the very same thing, t(z()). Again, all queues have been emptied. 


For the data flow graph shown, t={w,w,a,a,b,b,c,c,e,e,f,f,g,g,h,h, 


dda sdp PsP sPpododp odpt. 
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2.3.1 Data Flow Graphs - Summary 

Data flow graphs provide a formalism for describing the structural 
aspects of computation. They are not the only such mechanism one can 
devise. Karp and Miller’, Slutz!®, and Keller® have used a model in which 
actors are of a single, hybrid type. These "operations" combine the 
features of our operators and deciders by having one or more output 
locations and one or more possible outcomes. A graphical representation of 
such an actor is shown in Figure 2.5. An equivalent data flow graph 
structure is shown in Figure 2.6. We mimic the multiple outputs in the 
standard way. There are numerous methods for associating K(a) outcomes 
with collections of binary deciders. As shown, we use K(a)-1 deciders, 
and let the outcome correspond to the least index of a true decider, or 
K(a) if all are false. Thus, data flow graphs can model the hybrid 
operations. 

A more dramatic difference appears in one variant of the data flow 
schemata of Dennis? and Fosseen? Here actors are interconnected by fifo 
queues instead of memory locations. Data flow graphs can also reproduce 
the behavior exhibited by data flow schemata. It can be shown that the 
so-called well-formed schemata can operate with queues of length 1. Such 
queues are easily modeled by memory locations. Furthermore, queues can be 
modeled by identity operators between memory locations, using the implied 
queuing of multiple operator initiations. 

Therefore, data flow graphs are a fairly general model for describing 
the structure of a computation and are not likely to generate much 
controversy. Indeed, this is one reason why the data flow structure has 


been separated from the rest of the schema specification. 
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3.1.1 Control - Introduction 

Data flow graphs tell us only part of what we need to model a 
computation. They specify the structure of memory, transformations, and 
tests, but do not specify the order in which actors can initiate and 
terminate. This is the purpose of some sort of control mechanism. What 
we shall be referring to as a schema is a data flow graph and a 
specification of control. We review some possible control mechanisms 


before indicating how control is specified for our schemata. 


3.2.1 Control - Sequential Models 

The most familiar control mechanisms, the ones used by virtually all 
conventional programming languages, are sequential. There is no concurrent 
activity, so actor initiations and terminations occur with no intervening 
events. After each operator termination, there is a unique actor 
initiation which follows. When a decider terminates, there are two actor 
initiations which might follow, the choice being determined by the decider 
outcome. The rules about which actor initiates next are implicit in the 
semantics of the programming language used. 

Flow charts are a general way of specifying the sequencing of actors 
in a sequential program. Figure 3.1 shows a flow chart for the program in 
Figure 2.3. One can envisage a single locus of control moving along the 
arcs of the chart, initiating and terminating actors as it passes. Ifa 
program contains go to statements, the topology of such a chart can be 
quite complicated, a fact used to argue for better behaved sequential 


control primitives such as do-loops and if-then-else conditional clauses. 
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Figure 3.1 A flow chart control specification 


for the data flow graph of Figure 2.4 
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3.3.1 Control - Programs with Concurrency 

If we wish to maintain a program-like formalism but introduce 
concurrent activity, there must be a means of adding loci of control. 

If a program can be broken into completely independent parts, then each 
part can be assigned a dedicated locus of control. This is essentially 
what most multiprocessing systems do, assuring independence by working 
on unrelated programs. Practical programs, however, seldom factor into 
such independent parts. (If they did, they would be written as separate 
programs.) In general, then, the loci of control will interact. 

Fork and join primitives can be added to programming languages to 
express this interaction. At a fork, a single locus of control splits 
into several loci. At a join, a number of loci of control come together 
and a single locus exits. There have been numerous proposals!>* for 
implementation of such primitives, and we show a graphical representation 
of a program using these primitives in Figure 3.2. 

In inspecting such a graph, it is important to distinguish between 
the control flow through a fork primitive, in which control flows along 
all output arcs, and control out of deciders, in which control flows 
along exactly one of the output arcs. 

The resemblence to a precedence graph is quite striking here, with 
"unordered" actors being capable of concurrent activation. Precedence 
graphs, embellished to allow conditionals and while-loops have alse 


been used as control mechanisms% 
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Figure 3.2 A fork-join 


control specification for 
the data flow graph of 


Figure 2.4 


T 
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3.4.1 Control - Petri Nets 

Breaking away from program-like formalisms can lead to control 
structures such as Petri nets, introduced by Holt® Petri nets are such a 
simple yet powerful formalism for modeling concurrent activity that it is 
worth a short digression to study them, 

Petri nets are directed, labeled, bipartite graphs whose nodes are 
either states or events. States are drawn as circles and events as bars. 
States have an associated non-negative integer which is depicted by 
drawing the appropriate number of tokens in the state. This numbering is 
called the marking of the net. Activity in a Petri net is governed by a 
simple firing rule. An event is said to be enabled if each state on an 
arc into the event contains at least one token. An enabled event can 
ftre by removing one token from each input state and adding one token to 
each output state. Tokens need not be "conserved" by the process of 
firing. In fact, the total number of tokens in the net will change after 
an event fires unless the number of input states of the event equals the 
number of output states of the event. We show some Petri nets in Figure 
3.3. 

The event in Figure 3.3a is not enabled since there is an input state 
with no tokens. The event in Figure 3.3b is enabled, and, if it should 
fire, the resulting net would be that of Figure 3.3c. 

By associating the events determined by a data flow graph with a 
subset of the events in a Petri net, we can use the net as a control 


mechanism. We show such a net in Figure 3.4. 
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Figure 3.3a An event which is not enabled 


Figure 3.3b An event which is enabled 


Figure 3.3c The marking which would result if 


the event of Figure 3.3b fired 


Figure 3.4 A Petri net control specification for the data 


flox graph of Fizure 2.4 
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One reason that this control mechanism may appear complicated is the 
introduction of explicit initiation and termination events. Except for 
this, there is a strong resemblance to the fork/join formulation of Figure 
3.2. 

Slutz's flow graph schemata have a control mechanism which closely 


parallels Petri nets!® 


3.5.1 Control - Data Flow Models 

It is self-evident that an actor cannot initiate before its input 
values have been generated. Turning this observation around, Rodriguez}? 
Dennis? and Fosseen” have proposed models in which the arrival of values 
controls the initiation of actors. We will briefly describe one form of 
data flow schemata studied by Dennis and Fosseen? We call these "Dennis- 
Fosseen schemata" rather than "data flow schemata" to avoid possible 
confusion with our own data flow graphs. 

A Dennis-Fosseen schema is a directed, bipartite graph in which the 
nodes are either actors or links. It is helpful to think of the arcs in 
a Dennis-Fosseen schema as being able to hold a single value. Operators 
in Dennis-Fosseen schemata are actors with one or more indut arcs, a 
Single output arc, and an associated function name. If a value is present 
on each input arc and no value is present on the output arc, an operator 
is enabled to fire. When it fires, an operator removes the values from 
its input arcs and places on its output are the value which results from 
applying the named function to the inputs. Deciders in a Dennis-Fosseen 
schema have one or more input arcs, a single output arc, and an 


associated predicate name. As with operators, these actors are enabled 
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to fire when there is a value on each input arc and no value on the output 
arc. When a decider fires, the values on its input arcs are removed, and 
the boolean value resulting from the named predicate on the input values is 
placed on the output arc. Dennis-Fosseen schemata are constructed in such 
a way that arcs carry either boolean values or data values, never both. 

Dennis~Fosseen schemata may contain boolean actors to perform logical 
operations on boolean values. Boolean actors are enabled to fire when a 
boolean value is present on each input are and no value is present on the 
output arc. When such an actor fires, the input values are removed and the 
result of the named boolean operation on the input values is placed on the 
output arc. 

There are three types of actors which have both boolean and data 
input; true gates, false gates, and merges. A true gate has one input 
data are, one input boolean arc, and one output data arc. A true gate is 
enabled to fire when a.value is present on each input arc and no value is 
present on the output arc. When it fires, the input values are removed. 
If the boolean value was true, the input data value is placed on the 
output arc. Otherwise, no value is placed on the output arc. False gates 
are analogous, "passing" the input data value if a false boolean value is 
present and "swallowing" the data value for true boolean inputs. 

Merge actors have two input data arcs, one boolean input arc, and one 
output data arc. A merge is enabled to fire when there is a boolean value 
present, a data value on the input data arc indexed by the boolean value, 
and no value on the output arc. (The presence or absence of a data value 
on the other input data arc is irrelevant.) When it fires, a merge 


removes the boolean value and the indexed data input value, placing the 
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data value on the output arc. 

Link nodes serve aS connection points and also eliminate the need for 
explicit identity operators. A link node is enabled when a value is 
present on its input are and all output arcs contain no values. When a 
link node fires, the input value is removed and a copy of the value (data 
or boolean) is placed on each output arc of the link node. 

Figure 3.5 summarizes the elements of Dennis-Fosseen schemata. Each 
element is shown in enabled status and then, immediately to the right, the 
element is shown as if it had fired. Arcs which carry boolean values have 
solid arrowheads while data arcs have open arrowheads. 

Figure 3.6 shows how the elements of a Dennis-Fosseen schema can be 
assembled to model the program of Figure 2.3. This kind of schema is 
probably foreign to most readers, but with a little experimentation, one 
can quickly become familiar with its behavior. In particular, gates and 
merges come in groups fed by the same boolean. When initialized merges 
are encountered before the gates, a loop structure can be realized. This 
is the case with the three merges and five gates fed by the decider in the 
middle of Figure 3.6. When the gates are encountered before the merge, a? 
is true with the gates and merges fed by the deciders at the top of Figure 
3.6, conditionals are realized. We recommend that the reader check the 
behavior of the schema by first assuming that each decider is false, then 
assuming each is true. Note, in particular, that the "loop merges" are 


reinitialized to true when the loop is exited. 
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3.6.1 Control - Automata 

Consider a finite state acceptor whose state transitions are labeled 
with the events associated with a data flow graph. We can view the events 
on arcs out of a given state as those events which the acceptor allows in 
that state. Should one of the events occur, the acceptor enters the state 
to which the corresponding arc leads. wWe might call those sequences of 
events which lead from the initial state to some accepting state the 
sequences which are acceptable. 

Keller®, Karp and Miller’, and Slutz!® all consider state~machine 
control mechanisms, treating countably infinite state machines as well as 
finite state acceptors. Many of the other control mechanisms we have 
discussed have finite state counterparts which allow exactly the same 
sequences of events to occur. If, for a given n-state Petri net, there 
is an integer m such that no state in the net ever contains more than m 
tokens at a time, then there is a corresponding finite state control 
mechanism with no more than (m+1)" states. (A state in the finite state 
acceptor identifies how many tokens are present in each of the n states of 
the Petri net.) 

As the last example suggests, finite state acceptors may be consider- 
ably more complicated than the other models. In Figure 3.7 we show the 
beginning of a state table for a finite state control mechanism for the 
data flow graph of Figure 2.4. 

One can imagine how pushdown automata or even Turing machines could 


be employed to control the events in a data flow graph. 
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Figure 3.7 Partial state table of a finite state control specification 


for the data flow graph of Figure 2.4 
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3.7.1 Control Sets - Overview 

The last few sections serve to indicate the diversity of control 
mechanisms which have been or could be proposed to direct activity in 
data flow graphs. Each has advantages, real or imagined, which make it 
difficult to agree on a "best" mechanism. 

It is unfortunate that many of the results in schematology depend 
upon the particular choice of control mechanism, in ways that become 
clear only when one attempts to carry the result over to a different 
model. This works a particular hardship on newcomers to schematology 
since there is no general theory unifying the various models. 

In an effort to begin a more general approach to schematology, we 
have adopted a specification of control which is not tied to any 
particular control mechanism. We do this by observing that ali the 
mechanisms just discussed define sets of allowed sequences of events. 
We call these control sets, and we will generally be concerned with 
their properties as sets, not with the particular mechanism that 
determines them. 

This approach has both good and bad aspects. A control mechanism 
defines a control set inherently possessing properties which we must 
explicitly define and justify. Thus, our approach may appear slightly 
"verbose" and can be difficult to motivate. The advantage we gain in 
return is to make explicit those properties of control upon which 
schematalogical results depend. These results then carry over to all 
mechanisms which can be shown to impose the requisite properties. This 


may be au easier task than trying to establish the results directly. 
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3.7.2 Control Sets - Conventions and Formalisms 

Let £ be a finite alphabet. As is conventional, we will use =* to 
denote the set of all finite strings over Z. We let 4 represent the 
unique string of length 0, the empty string. We use EZ to denote the 
set of countably infinite strings over 2, and we define E to be the 
union of E* and I. We will use the terms 'string' and 'sequence' 
interchangeably. 

We use italicized letters late in the alphabet and possibly 
subscripted, v, w, Zos Xy5 and so forth, to denote sequences in im 
If we have sequences xel* and yel, then xy represents the sequence 
in z formed by concatenating x and y. Given zex and reZ*, x isa 
prefix of z if and only if there exists a sequence yek such that 
xcy=2. 

Suppose £ is the alphabet of events associated with a data flow 
graph. A string yck is said to be well-sequenced if, for every prefix 
x of y and for every actor a in the data flow graph, the number of 
occurrences of initiations of a in x is no less than the number of 
occurrences of terminations of a in x. In simple English, nothing 
is terminated which has not been initiated. Note that this property 
does not depend upon the connections in a data flow graph, only on 
the set of actors. 

A sequence yer is well-defined (for the data flow graph which 
determines £) if it is well~sequenced and for each prefix wa of y, 


and for each input location % of actor a, 2 is a schema input, or 


& is the output location of some operator o such that x = 210%,» 
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That is, in a well-defined sequence, no actor initiates until the 
contents of all of its input locations are meaningfully defined. A 
set of sequences is said to be well-defined if every sequence in the 
set is well-defined. 

We define a control set (of a data flow graph with alphabet of 
events £), CONTROL, to be any well-defined subset of 5 with the 
following property. For each finite sequence xeCONTROL, each schema 
output m is either the output location of some operator terminating 
in x, or mis a schema input (or both). Thus each control sequence, 
as we call the elements of CONTROL, which terminates leaves values in 
all schema output locations. 

Finally, and somewhat anticlimactically, we define a schema to 


be a data flow graph and some corresponding control set. 


3.7.3 Control Sets - Discussion 

The restriction of control sets to well-defined sequences is easy 
enough to justify. A string which is not well-sequenced, such as aa, 
simply has no meaning in terms of the behavior of data flow graphs. 
Similarly, we cannot make any sense out of actors initiating before 
the contents of their input locations have been established. For 
example, ee is well-sequenced for the data flow graph of Figure 2.4, 
but it is not meaningful because locations R and T have unspecified 


contents. Finally, if a sequence terminates, we expect it to leave a 


result in each schema output location. Although we do not exclude 
anythi:. very meaningful by assuming well-definition, there are noteworthy 


implications. For example, the finite strings the state-machine 
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of Figure 3.8 accepts are simply {a,a}*, but the set of all control 
sequences contained in {a,a}* cannot be recognized by any finite state 
acceptor. (Informally, wa must be accepted for all integers k20. 
For k greater than the number of states in a finite state machine, 
some state must be visited more than once while aX is being read. 

Then this cycle of length j>0 could be repeated so that akakt3 is 
also accepted. But this sequence is not well~sequenced, so it is 
not among the control sequences.) Thus, although it is semantically 


painless to exclude ill-defined sequences, it may be difficult or 


impossible to do so while maintaining a given control mechanism. 


Figure 3.8 A finite state control specification for which CONTROL is 


not a regular set 
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4.1.1 Interpretations - Introduction 

Schemata, via their data flow graphs and control sets, tell us 
everything of a structural and operational nature that we need to know 
about modeling computations. To put the schema model on a par with 
familiar models, however, we must supply the missing details about the 
function names and predicate names. 

An tnterpretation for a schema specifies the set of elements which 
the memory locations can contain. For "familiar interpretations", 
these will include integers, character strings, representations of 
real numbers, and so forth. Of course, there are also interpretations 
dealing with trees, or only with integers, or only with the number 6: 
the choice of domain is virtually unrestricted. 

An interpretation also specifies functions and predicates over 
these elements for the function names and predicate names of the data 
flow graph. These functions and predicates are total, so "familiar 
interpretations" would have to be extended to define such things as 
1/0 and "cat"<3.14159. Finally, the interpretation specifies the 


elements initially contained in the schema input locations. 


4.1.2 Interpretations - Formal Definition 
An interpretation I of a schema consists of 
1) A (mon-empty) set, DOMAIN, of elements. 
2) For each m-ary function name f, a total function es DOMAIN™>DOMAIN 
3) For each m-ary predicate name p, a total predicate p? :DOMAIN™+{T ,F} 


4) For each schema input location 2, an intttial element deDOMAIN 
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4.1.3 Interpretations - Summary 

Although we have defined interpretations of schemata, it is 
evident that the definition depends only upon the data flow graph 
component. If we have two or more data flow graphs with the same 
number of schema inputs, we can meaningfully define a single 
interpretation for all of them. There will be a single DOMAIN, 
and the set of initial elements will apply to each set of schema 
inputs. Functions and predicates must be assigned for all function 
names and predicate names, but since the arity is implicit in the 
names, this can be done consistently. That is, f cannot be a unary 
name in one graph and a binary name in another. Thus, a function 
associated with name f works in all data flow graphs. We will later 
speak of interpretations for two or more schemata, understanding 


that this is what is meant. 


41.4 Interpretations - Example 
Suppose we define an interpretation for the data flow graph of 
Figure 2.4. We can recapture the original intent of the program 


from which it was derived by the following interpretation. 


i 


1) DOMAIN = Z (The integers) 
2) z():=0 m(x) :=|x| t(x):=x+1 c(x) :=-x s(x,y):=x-y 
3) 2&¢x,y) :=x<y 


4) Initial elements = <l,-1> 


Example 4.1 An interpretation for the data flow graph of Figure 2.4 
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Cyan eee Data Dependence Graphs - Introduction 

We have defined schemata as the combination of data flow graphs, a 
formalism for describing the interconnectior of memory locations and 
actors, and control sets, a formalism for defining how events in a data 
flow graph are sequenced. We now define a mechanism, the data dependence 
graph, or dadep graph, which brings these two formalisms together. 

Before proceeding, it should be noted that dadep graphs are derived 
algorithmically from a data flow graph and associated well-defined 
sequence. Thus, there is really nothing in a dadep graph we didn't 
already have, given the data flow graph and sequence. Our justification 
for jai voducine this new formalism is one of clarification and 
convenience. 

The virtue of dadep graphs is that they make it easy to overlook 
some less interesting aspects of schemata such as the choice of names 
for memory locations and actors, and focus on relevant issues such as 
transforming and testing values via functions and predicates. As we 
shall see, it is possible to do this in such a way that we can derive 
important information about interpretations from the structure of the 
dadep graph. 

Dadep graphs are labeled, bipartite graphs whose nodes we call values 
and actions. Starting only with the initial values input to the schema, 
actions are added as actors initiate. One set of labels indicates the 
function name or predicate name associated with an action. Another set 
of labels indicates which values are currently contained in the memory 
locations. This labeling aids in determining which values are inputs to 


an action, shown by directed, indexed arcs from values to actions. 


(ex 
NO 


New values are addec when operators terminate. When deciders terminate, 
another set of labels indicate the outcome for the associated decision. 
Identity operators do not create values, but rather change the labeling 
with memory lcc:tion names to nake existing values available under new 
location mamas. -Ster formalizing these notions, an example of a dadep 


graph will be vevelonecd in some detail. 
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5.1.2 Dadep Graphs - Formal Definition 

The dadep graph determined by a data flow graph and corresponding 
well-defined sequence is a directed, labeled, bipartite graph. One set 
of nodes is referred to as VALUES. The other set is referred to as 
ACTIONS. ACTIONS are partitioned into OPERATIONS and DECISIONS. If 
the data flow graph has m schema inputs, then there exist m values 
labeled 1 through m called INITIAL VALUES. Each operation is labeled 
with a function name and each decision with a predicate name. If an 
action has an n-ary name, there are arcs labeled 1 through n incident 
upon the action from n (not necessarily distinct? values. There may be 
a single are from an operation to a value. Decisions are either 
unlabeled or are labeled with a T or an F. A subset of VALUES are 
labeled with one or more locations in MEMORY. 

Suppose we fix a data flow graph. We define inductively the dadep 
graph determined by a corresponding well-defined sequence. For 
notational convenience, we will use subscripting to identify the 
sequence determining a dadep graph. For example, VALUES will denote 
the set of values in the dadep graph determined by sequence x. 

For precision in the induction, we state the following induction 
hypothesis: 

Hypothesis 1: Each memory location which is either a schema 
input or the output location of some operator terminating 
in x is the label on exactly one value in VALUES, « 

Looking ahead to some further definitions, we also add induction 
hypotheses: 


Hypothesis 2: The dadep graph of sequence x is acyclic. 
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Hypothesis 3: Each value in VALUES, . is either one of the 
INITIAL VALUES, or it has a single input arc originating 
from an operation in OPERATIONS. (but not both.) 

For the basis of our induction, we consider the dadep graph of the 
empty sequence, A. If the data flow graph has m schema inputs, then 
VALUES, = {Vj oVooeesov Fe If the i schema input is location 2, then 
value Vv; is labeled 2. Furthermore, Vv; is distinguished as the a 
initial value. There are no actions and no arcs. Tt is easy to see 
that hypotheses 1, 2 and 3 are trivially satisfied. 

Suppose that all the components of the well-~iefined sequence x have 
been defined, and xo, ocZ, is also well-defined. We define the dadep 
graph of xo by cases as follows: 

INITIAL VALUES = INITIAL VALUES, in any case. Ifo is not the 
termination of some decider d, let j be the number of occurrences of o 
in xo. Otherwise, let j be the number of terminations of d in xo. 

Case 1: o is the initiation of an actor c other than an identity 
operator. Suppose g is the function name or predicate name associated 
with c. VALUES | = ES and the labeling with memory locations is 
unaltered. We form Bron by adding a new action, a, to Bele 
If c is an operator, a is added to OPERATIONS 2 Otherwise, it is 
added to Ete TONS ee Action a is labeled g. Since yo is well- 
defined, each input location of c is either a schema input, or is the 
output location of an operator terminating in x. By hypothesis 1, 
each of these locations is the label of exactly one value in VALUES 
If location g is the rie input location of c, add an are labeled i 


from the value in VALUES,, (=VALUES,) with label g to the new action a. 
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(If c is a O-ary operator, no arcs are added.) All other arcs remain 
as in the dadep graph of w. We refer to a as "the action corresponding 
to the as occurrence of c. 

Hypothesis 1 held for the dadep graph of x, no terminations were 
added, and the labeling with memory locations was unchanged. Hence 
hypothesis 1 still holds for the dadep graph of xo. Hypothesis 2 
still holds since all arcs added were to a node not in the dadep graph 
of £, so no cycies could be introduced. Hypothesis 3 follows from the 
fact that no new values were introduced. 

Case 2: o is the initiation of an actor c which is an identity 
operator. Suppose % is the input location of operator c. As in case 
1, there is a value in VALUES labeled with &. We will refer to this 
value as ''the value corresponding to the an occurrence of c." The 
dadep graph of xo is exactly the same as the dadep graph of x, so all 
hypotheses hold. 

Case 3: o is the termination of operator c which is not an identity 
operator. ACTIONS = ACTIONS... We form VALUES by adding a new value 
v to VALUES... Since xo is well-defined, c has initiated at least j 
times in x, so there will be an operation a in ACTIONS, corresponding to 
the ae occurrence of c. Add an arc from a to v, leaving all other arcs 
unchanged. Suppose the output location of c is 2. If a value in 
VALUES, has label 2, remove the label in the labeling of VALUES,» and, 
in any cass, Label v with 2. 

The (re)labeling procedure guaraiutees the validity of hypothesis 1. 
Hypothesis 2 follows because the cily anew arc is to a node not in the 


dadep graph of x. It will be seen that no other cases add values or 


arcs into existing values. Thus hypothesis 3 is satisfied. 
Case 4: ¢ is the termination of a decider c. Since xo is well- 
defined, c has initiated at least j] times in x. Hence, there is a 
fee ‘ . 4 ; > th 
decision a in ACTIONS, corresponding to the j occurrence of c. In 
the dadep graph of xo, a is given label T if O=Cp 5 OP label F if o=c 
No other changes are made, so all induction hypotheses remain valid. 


Case 5: o is the termination of identity operator c. Since xo is 


well-defined, there are at least j initiations of c in x. Hence there 


is a value v in VALUES, , corresponding to the ace occurrence of c. If 
the output location of c is & and & is the label on some value in 
VALUES.» remove the label in the labeling of VALUES In any case, 
label v with & in VALUES, No other changes are made. 


The (re)labeling preserves hypothesis 1, and since no arcs or nodes 


are added, hypotheses 2 and 3 also hold. 
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5.1.3 Dadep Graphs - Example 

The formal definition of dadep graphs will be useful in constructing 
some believable proofs, but it makes dadep graphs appear more formidable 
than the really are. Stepping through a specific construction should 
help to eliminate many potential sources of confusion. In Figure 5.1 
we do this for the data flow graph of Figure 2.4 and the well-defined 


sequence x = wweababdd .cffeedd,pp,qq,hh. 


Values are drawn as boxes, actions as the same shape as the 
corresponding actor in a data flow graph. We identify the initial 
values by the index depicted inside. The memory labels appear 
alongside values, function and predicate names appear inside the 
corresponding actions, and decision outcomes appear near the 
predicate names. We use the same arc labeling conventions 
introduced for data flow graphs. 

As an inspection of the dadep graph of x reveals, dadep graphs 
display the structural relationships between the values input and 
those generated by a sequence. This structure can then be used to 


2 


study what happens under a particular interpretation. 


N 


Figure 5.la Dadep sraph of A 


Kf] 


Figure 5.lb Dadep graph of w 


«O] 


Figure S.le Dadep graph of ww 
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Figure 5.1d  Dadep graph of wwe 


Figure 5.le Dadep graph of wwca 


Figure 5.1£ Dadep graph of wweab 


Figure 5.1lg Dadep graph of wweaba 


4D 


Figure 5.1nh Dadep graph of wweabab 
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A> 


Ligure $.1%  Dacep graph of wweababdd,, 
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| 4, 


Figure 5.1% Dadep graph of wweababdd_cff 
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c 


Figure 5.2 Dadep graph of x 
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Gadi alk Dadep Graphs - Definition of Depth 

In the sequel, we will find it convsanient to use certain structural 
properties of dadep graphs without havins to consider the particular 
sequence and data flow graph which determines them. One such feature is 
the depth of values and actions in a dadep graph. The depth will 
simply be the length (in terms of number of arcs) of the longest 
directed path leading to the value or action. 

Being a little more precise, we define the depth of all initial 
values to be C. We also define the depth of all O-ary operations in a 
dadep graph to be 0. The depth of any value other than an initial value 
is defined to be 1 plus the depth of the operation of which it is the 
output value. (Such an operation exists by hypothesis 3 of dadep graphs. ) 
The depth of an operation which is not O-ary is defined to be 1 plus the 
maximum depth of its input values. The acyclic nature of dadep graphs 
(hypothesis 2) guarantees the validity of this definition. Figure 6.1 
shows a dadep graph with the dcpth of each node indicated. 

Note that if there is a value [action] at depth k>1, there must be a 
value [operation] at depth k-2 on a path to it. Thus, there can be no 


"gaps in the depths of values [actions] greater than 2. 
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Figure 6.1 Depths of the nodes in the dadep graph of Figure 5.2 
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Interpreting Ladep Graphs 

Any interpretation of a schema will supply all detail we need to 
associate specific domain elements with the values in a “adep graph. The 
réle of decisions in the generation of values is indirect, and its 
explanation presupposes a thorough understanding of the nature of values 
and operations. We will therefore ignore, for now, the component of 
interpretations which specifies predicates, and focus on the domain, 
functions and initial elements. 

These three components define an assignment of domain elements to 
values in the obvious way. Initial elements are paired with initial 
values and the functions determine all other assignments. 


More formally, given a dadep graph D and an associated interpretation 


I, define a mapping el. VALUES +* DOMAIN by induction on depth as follows. 


Basis 1): If the depth of value v is 0, then v is an initial value, 
eth : I dt estcrs : 
say the i. Lefine § (v) to be the initial element of I corresponding 


-th . - 
to the i input location. 


Basis 2): If the depth of v is 1, then v must be the output value of 
an operation with a O-ary function name, say f. Define eT (y) to be the 


element Q) determined by I. 


7 


Induction step): Suppose et is defined at all values whose depth is 
k (>1) or less. If there is no value at depth k+l or k+2, then e: is 
defined for all values. (Recall that no gap greater than 2 can exist 
between depths of values.) Otherwise, consider a typical operation o 


with m-ary function name f, of which one of these values i3 an cutput 
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value. The depth of o is either k or k+l so the depths of o's input 

values, Vi> Yoo rae Vv, are k or less. Hence, ee is defined at each 
. i I, 1 Agee 

v, and we define E-(v) to be fF CE (v,)> ze (v))- 


(The next induction step is at level k+2.) 


In Figure 6.2 we show how such an assignment can be made by 
combining the dadep graph of Figure 5.2 with the interpretation of 
Example 4.1. The element associated with each value is shown inside 
the box representing the value. 

Such an interpreted graph shows a "history" of the elements 
generated if the given sequence can be observed under the given 
interpretation. (This "can be observed" issue is where the predicates 
will come in.) The labeling with memory locations then identifies 
the final contents of the schema output locations. Thus, such an 
interpreted dadep graph constitutes a completely satisfactory 
description of the input/output behavior of schemata for what we will 


define as I-conputattons. 


Figure 6.2 Assignment of elements to the values of the dadep graph 


of Figure 5.2 using the interpretation of Example 4.1 
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6.3.1 Free Interpretations - Introduction 

The starred values in Figure 6.2 all were assigned the same domain 
element. We would like to know if this was purely coincidental, or if 
it will be true for all interpretations. This leads us to introduce 
free or Herbrand or one-one interpretations, and the notion of 
simtlartty in dadep graphs. 

A brief inspection of the interpreted dadep graph of Figure 6.2 
reveals that if a different set of initial elements had been chosen, 
say 5 and -3, then the starred values would have been distinct. It 
would be nice if we could find an interpretation which assigns the 
same domain element to distinct values only if all interpretations 
assign the same element to them. Such interpretations in fact exist, 
and are variously called free, Herbrand, or one-one interpretations. 
These interpretations are a formalization of the "symbolic 


representation" we referred to in the definition of events. 


6.3.2 Free Interpretations - Formal Definition 

To define a free interpretation of a schema, let DOMAIN be the set 
of strings over the "alphabet" composed of FUNCTION NAMES, the integers 
from 1 through the number of schema inputs, and, for clarity, commas and 
parentheses. The initial element corresponding to the first schema 
input location is 'l', that corresponding to the second schema input is 
'2', and so forth. 


If f is an m-ary function name, then - applied to domain elements 


(which are strings) 81> So» » 5, is defined to be the string formed 


by concatenating function name f, a left parenthesis, S;> a comma, So, 
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a comma, and so on through So which is followed by a right parenthesis. 
For example, ££'1','g(2)','hC)"] = '#(1,¢(2),h0)!. 

Any set of total predicates over the given DOMAIN can be used to 
complete the interpretation. Thus, a data flow graph really has a family 
of free interpretations, differing only on the predicates assigned to the 


predicate names. 


6.3.3 Free Interpretations - Universality 


Suppose G, and G, are data flow graphs with the same number of schema 


1 


inputs. Let Dy and D,, be dadep graphs of well-defined sequences for the 


respective data flow graphs. Then values vy of DS and V5 of D, are 


assigned the sans clement by every interpretation if and only if they are 
assigned the same element of a free interpretation. 
The 'fonly if' half of the above result is trivial. Therefore, we need 


only demonstrate that if vy and Vy are assigned the same element by a free 


interpretation, then every interpretation assigns them the same element. 
We assume the contrary and derive a contradiction. 


Suppose that it is possible for values vy and Vo to be assigned 


different elements by some interpretation I even though they are 
assigned the same element by a free interpretation. We can assume 


without loss of generality that v, is a value of least depth for which 


1 
such a condition can arise. 


If the depth of va is 0, it must be an initial value, say the ek 


Since v, is assigned the same element 'i' by a free interpretation, it 


a 


: a od 9 ee : eee 
must be tne 1 initial value in Dos But then the same initial element 


of every interpretation is assigned to both Vy and Vos So Vv, must be at 


TL 


depth 1 or greater. 


If vy is at depth 1, then it is the output value of an operation with 


some O-ary function name f. The element of a free interpretation assigned 


tov, is therefore 'f()'. If ‘ is assigned this same element by a free 


interpretation, it too must be the output value of an operation with O-ary 


function name f. Since 26) would then be assigned to both vo and Vo for 


every interpretation I, it follows that v, must be at depth 2 or greater. 


1 


I+ must therefore be that vy is the output value of an operation Oo, 


with m-ary function name g, and m2l. Therefore vy and Vy are assigned 


element 'g(s) 28 ee by a free interpretation, where s, is the 


2? 
: . : th. 
element assigned by the free interpretation to the 1 input value, Viie 


of O,- Thus Vo must also be the output value of an operation o, with 


function name g and input values Vox through Vom which are assigned 


elements Sy through S. of the free interpretation. But each ae has path 


depth at least 2 less than vy: By hypothesis, each wage being assigned 


the same element s. of a free interpretation as is assigned to Mose must 


be assigned the same element as v,. for every interpretation. But then 


23 


vy and Vo could be assigned differing elements only if a" gave different 


results for the same m-tuple of elements. This violates the functionality 
I 
of g. 


We conclude that an interpretation which assigns different elements 


to Vy and Vo cannot exist. U 
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6.4.1 Similarity - Formal Definition 


Suppose we have two data flow graphs, Gy and G,, and dadep graphs D, 


2? 
and D, of corresponding well-defined sequences over the respective data 


flow graphs. We define similarity, a relation on actions and on values, 
by the following induction. 


Values vy in Dy and Vo in Dy are simtlar if each is the oe initial 


value, or if they are output values of similar operations. 


Actions ay in D, and a, in D, are stmtlar if they are labeled with 


the same function name or predicate name, and, if the name is m-ary with 


th 


m21, then for lsi<m, the es input value of a, is stmtlar to the i 


1 

input vaiue of ays 
Again, the acyclic nature of dadep graphs guarantees the validity of 

this inductive definition. That is, working back along directed arcs 


always leads to an initial value or a 0O-ary operation. Values or actions 


which are not similar are said to be dissimilar. 


6.4.2 Similarity - Relation to Free Interpretations 
We now show that values are similar if and only if they are assigned 
the same element by free interpretations. 


If values vy and V, are similar and the lesser depth is 0, then one 


of the values, say Vi? is the it® initial value. By hypothesis 3 of 


th 


dadep graphs and the definition of similarity, v,, must also be the i 


2 
initial value of its dadep graph. Both values are thus assigned 'i' by 
a free interpretation. 


Suppose similar values are assigned the same element by a free 


interpretation if the lesser depth of the values is k or less. Suppose 
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v, and v, are similar values such that the depth of v, in its dadep 


iL 2 BE 


graph is k+l or k+2. Since vy and Vy are similar and neither is an 


initial value, v5 and Vv, are output values of similar operations O71 


and o,. Since o, and o, are similar, they bear the same m-ary 


2 1 2 
function name f. Ifmis 0, v_. and Vv, are both assigned 'f()' by a 


. 3 ‘ bap iS loot . th 
free interpretation. If m is positive, then, by definition, the i 


1 


input value of O71 is similar to the fas input value of oO» for lsis<m. 


Since the depth of each input value of o, is k or less, it follows 


by our induction hypothesis that the rear input value of O17 is assigned 


1 


7 7 s -th . 
the same element, 83> of a free interpretation as is the i-° input 
value of On+ But then vy and V5 are both assigned "£6 1585> 6)! 
by any free interpretation. 
The proof that values assigned the same element by a free 
interpretation are similar can be carried out by induction on lesser 


depth in a manner directly analogous to the proof above. We therefore 


omit the details of the proof. 


Combining the last two proofs allows us to draw the following 


important conclusion. 


Theorem 6.1): Values vy and Vo 


assigned the same element by all interpretations if and only if the 


in dadep graphs dD and Ds are 


values are similar. 


74 


64044 Dadep Graphs - The Réle of Decisions 

Having investigated in some detail the nature of values in dadep 
graphs and the relation they bear to interpretations, we are now in a 
position to study decisions. 

As we saw, decisions are irrelevant with respect to the assignment 
of elements to values by any interpretation. However, the decisions in 
a dadep graph indicate whether the graph (and underlying sequence) are 
in some sense legitimate. 

For example, if & were interpreted as "larger than or equal to" 
instead of as "less than" in Figure 5.2, the predicates determined by 
the interpretation would have specified the opposite outcomes from 
those in the dadep graph. Thus, the dadep graph is not consistent 
with such an interpretation. 

We will formalize this notion of consistency in several steps. We 
will first define what it means for a single decision to be consistent 
with an interpretation. This easily extends to the definition of dadep 
graphs being consistent with an interpretation. It will then be 
possible to define what we mean by sequences, even infinite ones, being 
consistent with an interpretation. We can then circumvent interpretations 
and define consistency between decisions, dadep graphs and sequences. 
Throughout, consistency has the flavor of "agreeing on the outcome of 


identical decisions", 
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6.5.2 Consistency - Formal Definition 

Suppose we have a dadep graph D and a decision d in D. Suppose d is 
labeled with predicate name p and one of the outcome labels, T or F. 
Then decision d is said to be tnconststent with interpretation I if 
pe on the elements assigned to the input values of d has the outcome 
opposite that with which dis labeled. In any other case, including the 
case where d has no outcome label, d is said to be conststent with I. 

A dadep graph D is said to be conststent with an interpretation I if 
each decision in D is consistent with I. D is tnconsistent with I if 
any decision in D is inconsistent with I. 

We have defined dadep graphs only for finite, well-defined sequences 
over a given data flow graph. Although it would be possible to extend 
the definition to infinite well-defined sequences, we are not as 
interested in the details of such infinite sequences as we are in the 
fact that they never terminate. To handle (potentially infinite) 
sequences, then, we make the following definition. 

A well-defined sequence is conststent with an interpretation I if 
the dadep graph of every prefix of the sequence is consistent with I. 
Otherwise, the sequence is said to be tnconststent with I. 

A control sequence consistent with an interpretation I can be thought 
of as a computation which might be observed for the schema under 
interpretation I. In this spirit, we call them I-computattions. 
then 


If dq, is a decision in dadep graph D, and d, a decision in D 


1 22 


we define qd, to be conststent with d, if there is some interpretation 


with which both qd, and d, are consistent. Otherwise we say dq is 


tneonststent with dj. From our knowledge of similarity and 
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interpretations, we know that dy and d, are inconsistent if and only if 


they are similar and are labeled with opposite outcomes. 

A dadep graph is said to be (self-Jconsistent if every decision in the 
graph is consistent with every other decision in the graph. It is not 
difficult to see that a dadep graph is consistent if and only if there is 
an interpretation with which it is consistent. 

Dadep graph D, is defined to be conststent with dadep graph D, if each 


is self-consistent and each decision in Dd, is consistent with every 


decision in D+ (It follows automatically that each decision in Dy is 


then consistent with every decision in D,.) Because we required that each 


1° 


dadep graph be self-consistent, it follows that D, is consistent with Dy 


Chaving the same number of initial values) if and only if there is some 
interpretation with which both are consistent. If Dd and D, are not 
consistent we say they are inconsistent. 

A control sequence is (self-)consistent if the dadep graph of. every 


prefix of the sequence is self-consistent. Control sequence zy af schema 


S, is conststent with control sequence x 


1 of S, Chaving the same number 


2 


of schema inputs) if both are self-consistent and the dadep graph of every 
prefix of zy is consistent with the dadep graph of each prefix of Zoe 
Therefore, two control sequences are consistent if and only if there is 


some interpretation with which both are consistent, 


77 


6.5.3 Consistency - Some Perspective 

In sequential schemata such as Paterson's flow chart schemata (which we 
henceforth simply call Paterson schemata to avoid confusion with data flow 
graphs), an interpretation completely determines what will occur. Only at 
decider outcomes is there any "choice" in a sequential schema, and the 
outcome of all decisions is fixed by an interpretation. 

When we move to parallel schemata, the choice of interpretation no 
longer determines which event must occur at each step in a computation. 
For example, in the Petri net model of Figure 3.4, the first event to occur 
might be w, a, or b, no matter what interpretation is involved. (Which of 
these events actually occurs first might, for some particular implementa- 
tion, depend upon such things as the availability of a suitable processor 
or the details of a scheduling algorithm. These issues are not dealt with 
by our model: We are only concerned that the control set might allow any 
of several events to occur.) Although an interpretation specifies the 
outcomes of all decisions, for parallel schemata this no longer uniquely 
determines a control sequence. The choice of an interpretation I only 
restricts control sequences to that subset consistent with I. Within this 
set of I-computations, any sequence might be observed for the schema under 
interpretation I. 

As mentioned, the self-consistent sequences are precisely those which 
are consistent with some interpretation. This subset of CONTROL is 
sufficiently important that we give it a name, EXECUTION SEQUENCES. A 
control sequence which is not an execution sequence is, in one sense, 
uninteresting since it cannot be observed under any interpretation. It 


may not be possible to uniformly eliminate inconsistent sequences, however. 
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6.6.1 Equivalence of Sequences - Formal Definition 

We can now define a meaningful comparison between control sequences 
of the same or differing schemata. Our definition of equivalent 
sequences will amount to producing the same, if any, outputs for all 
interpretations. We find it useful not to insist that equivalent 
sequences be consistent. 


Formally, suppose we have control sequence x, of schema Sy) having m 


1 1 
inputs as < 
schema inputs and n, schema outputs, {Rj oky5> Han! Suppose also 
that ZL, is a control sequence of schema So having Ms inputs and No outputs 


{25 boo» Pan Then #, and x, are (output) equtvatent if 


1) m,=m, and nj)=n,, and 


2a) both Ly and “, are infinite, or 


2b) both zy and Zo are finite, and, for all interpretations I, and for 


all lsi<n,, the element assigned by I to the value with memory label 


1? 
aa in the dadep graph of xy is the same as the element assigned to 


the value with label Ros in the dadep graph of Ly: 


An immediate corollary of Theorem 6.1 is that clause 2b) can be 
replaced with 
2b') both x and x, are finite, and, for all isisn,, the value with label 


&,. in the dadep graph of xy is similar to the value with label Lo: 


li 
in the dadep graph of Zos 
We will generally use this second form which makes no explicit 
mention of interpretations. We write x45, to denote that a, is 


equivalent to Xo: 


GsTed Determinacy and Equivalence of Schemata - Formal Definition 
A schema S is determinate if, for all interpretations I, all 
I-computations are equivalent sequences. Alternatively, schema S is 

determinate if, for all control sequences xy and x, such that x, is 


2 ‘lh 


consistent with Boy Ly and ZR, are equivalent sequences. That the two 


definitions are equivalent follows from the fact that x and x, are 
consistent if and only if there is an interpretation T such that both 
are I-computations. 

This form of determinacy is sometimes calles. output deteruaey or 
output funetionaltty, since it deals only with the final contents of 
output locations. Since it is the only form of determinacy we will 
treat, we will use the simpler torn. 


Two schemata S, and S, are equivalent if, for all interpretations I, 


2 


4 


if « is an I-computation of one schema, the: there exists an equivalent 
t-computation y of the other schema. Again, the definition can be 
restated so that interpretatiors ‘o not appear. In particular, Sy and Si 
are equivalent if, for each execution sequence x of one schema, there is 
a consistent, equivalent execution sequence y of the other. 

Note that as stated, schemata need not be determinate for equivalence 
to apply. It is an easy exercise to show that two schemata are 
equivalent only if both are determinate or both are non-determinate. 

For determinate schemata, other definitions of equivalence are also 
seen in the literature. We can define two determinate schemata to be 
weakcy equivalent if, for all interpretations I, if both schemata have 


finite I-computations, then these sequences are equivalent. There is no 


natural analogue to weak equivalence among non-determinate schemata. 
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6.8.1 Schemata - Some Perspective 

Schemata as we have defined them have been the result of taking the 
common notion of a computation and splitting it into three parts. One 
part, the data flow graph, specified structural relationships between 
memory locations and actors, and indicated where inputs and outputs were 
expected. Another part, the interpretation, supplied the detailed 
information about the potential contents of memory locations and the 
effect of actors on these contents. The third part, the control set, 
specified allowable sequences of actor initiations and terminations. 

Dadep graphs were defined as a representation of a sequence and a 
data flow graph. We showed the "universality" of free interpretations, 
and the relationship of similarity in dadep graphs to equality in free 
interpretations. This enabled us to show that the structure of dadep 
graphs also includes "all we need to know" about interpretations. That 
is, the definitions of schemata determinacy and equivalence obtained by 
quantifying over all interpretations can also be stated in terms of 
dadep graph structure without explicit mention of interpretations. 

Thus dadep graphs constitute a powerful investigative tool combining 
all three of the parts mentioned above. They will find considerable use 


in the remainder of this dissertation. 
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Toled Schematology ~ Some Pitfalls 

It has long been known that the equivalence problem for Turing machines 
is not decidable!9,14, Since one can encode a Turing machine in most 
interpreted models for computation!?, the equivalence problem for these 
models is also undecidable. An early hope for schematology was that by 
demanding equivalence for all interpretations, these encoding tricks could 
he circumvented and the schemata equivalence problem found solvable. 

Unfortunately, the equivalence problem for Paterson schemata has been 
shown to be undecidable?>!!. Since the schemata we have defined can mimic 
Paterson schemata, it follows that the general equivalence problem for our 
schemata must also be unsolvable. 

As a further consequence, we can show that the general determinacy 
problem is also undecidable. To see this, suppose we have two m-input 


n-output Paterson schemata Sy and S, whose equivalence we would like to 


2 
test. We can assume that ACTORS, NACTORS,=$, since actors can be relabeled 
without altering function names or predicate names. We can then form a 
composite schema by merging the schema inputs and schema outputs of the 
data flow graphs as shown in Figure 7.1, and letting the new control set 
be CONTROL, UCONTROL, . 

It is a property of Paterson schemata that each interpretation I 
defines exactly one I-computation. Thus, our composite schema would have 
exactly two I-computations, one from Sy and one from Sys The composite 


schema will be determinate, then, if and only if S, and S, are equivalent. 


1 


Since the latter is undecidable, so is the former. 
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Figure 7.la Data flow graph of Sy Figure 7.1b Data flow graph of S, 


Figure 7.1c Data flow graph of composite schema 
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7.2.1 Properties of CONTROL - Introduction 

Knowing that the general equivalence and determinacy problems are 
undecidable, we will concentrate our efforts on identifying classes of 
schemata where the problems are more tractable. We define these classes 
by putting "syntactic" restrictions on control sets. So far, the only 
restrictions on control sets of schemata are that the sequences be well- 
cafined and that finite control sequences leave results in the schema 
output locations. As we shall see, this allows control sets which run 
counter to our intuition about "proper" control. After developing some 
familiarity with control sets, we will begin restricting them to obtain 
more ponsonabae classes. 

If o is an event and x is a prefix of a control sequence, then we say 
that o is enabled after prefix x if xo is also a prefix of a control 
sequence. One must be careful not to read more into this definition than 
is really there. Without further details about the mechanism which 
determines the control set, all we can conclude is that after x has 
occurred, there is nothing to prevent o from happening next. Of course, 
xo may not be consistent with a given interpretation, or, for that matter, 
with any interpretation. In parallel schemata, we generally expect that 
many events may be enabled after a given prefix. 

If a control mechanism allows two events o7 and On to occur 
"simultaneously" after prefix x, this will be manifested in the existence 


of prefixes xo and x00 The converse is not strictly true. Figure 


ig pale 
7.2 shows two Petri net controls each of which allows sequences 


{5,02 5750,}. Only in the first can the events be truly simultaneous. 


To us, however, the distinction is not an important one. 


o & @) 
iL 


} 


Figure 7.2 Two Petri nets with CONTROL = {0,0,500, 


> 
g 


Figure 7.3 A Petri net control specification which is not commutative 


85 


7.3.1 CONTROL - The Prefix Property 

One property we might expect of a reasonable control mechanism is 
that it be able to detect when a computation had completed. Since we 
are dealing with speed-independent systems, we must rely on the control 
mechanism to amnounce when things have come to a halt and results are 
ready in the schema outputs. 

The prefix property guarantees that when a computation terminates, 
no further activity is possible. Stated formally, a control set has the 
prefix property if no control sequence is the proper prefix of any other 
control sequence. In other words, after a finite control sequence, 
nothing is enabled. 

When dealing with a particular control mechanism, the control 
sequences are usually so defined as to have the prefix property 
automatically. This bears out our comment about the set-theoretic 
approach appearing to formalize the trivial. However, as the single 
state control mechanism of Figure 3.8 shows, what is obvious in some 


mechanisms may be absent in others. 


7.4.1 CONTROL - Persistence 

In the proof of the undecidability of determinacy, the control set 
of the composite schema had some peculiar properties. The compcsite 
control set was formed by taking the union of two control sets of 
sequential schemata whose alphabets of events were presumed disjoint. 
Initially, then, there would be two events enabled, corresponding to the 
first event from each of the component schemata. As soon as an event 


occurs, however, one of the component schemata becomes "irrelevant" 
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since it has no control sequences starting with that event. The event 
originally enabled in this component schema ceases to be enabled. Note 
that this happens, not because of an inconsistency with an interpretation, 
but because “something clicks" in the control set and precludes any 
further activity. 

If the reader feels that something is amiss here, it is probably 
because most of the familiar models for parallel computation have a 
property we call persistence which is lacking in the composite schema. In 
a persistent schema, once an event is enabled, it remains enabled until it 
occurs, or, in the case of a decider termination, the termination with 
opposite outcome occurs. 

Formally, a schema is persistent if, given any prefixes IO, and m0, of 
and g, and o, are not opposite 


3 lL 2 


terminations of the same decider, then IO 55, is also the prefix of a 


control sequences such that a 


control sequence. (By symmetry, TO5o) would also be a prefix.) 

Sequential schemata are trivially persistent since the only distinct 
prefixes of the form xa, and xO, are those for which a, and o, are 
opposite terminations of a decider. Fork/join formalisms are also 
persistent since the loci of control are independent except at join nodes. 
Petri nets are persistent if multiple arcs out of state nodes lead only to 
alternative termination events of a decider. (This is a sufficient 
ccendition, but not a necessary one, as demonstrated in Figure 7.2.) 
Dennis-Fosseen schemata are always persistent since firing any enabled 
eleent cannot disable any other element. Finite state control mechanisms 


need not be persistent, but the property can easily be verified by 


inspecting the state graph. 
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Te del CONTROL - Commutativity 
Persistence implied that the occurrence of one event could not disable 


another event. Thus, if xO, and zo, are consistent, distinct prefixes, 


LO 55, and, reversing roles, xo o> are also prefixes. Commutativtty will 


2 
imply that if two events can occur in either order, as with ow, and 9, after 
prefix x above, then the actual order of occurrence is unimportant to the 
control mechanism. This "unimportance" can be stated in syntactic terms 

by requiring that anything which can happen after xO,0, can also happen 


after LO no,» and vice versa. 
Formally, a schema is commutative if, for all prefixes x0,0, and £050, 


of control sequences , L519 HY is a control sequence if and only if LO555Y 


ak 
is a control sequence. 
Sequential schemata are trivially commutative. Petri net and fork/join 
formalisms are not necessarily commutative. We show a Petri net 
counterexample in Figure 7.3. A fork/join counterexample would look much 
the same. Both 040,09, and T,5,0, are prefixes, but F309 59), is a prefix 


whereas 0,0,0,0, is not. The problem here is that o, in the two prefixes 


3°12 °4 dl. 


corresponds to different events in the Petri net. If events cannot be 
"enabled in parallel with themselves" as above, then fork/join controls 
and Petri nets, with branching out of state nodes restricted as in the 
last section, will be commutative. Dennis-Fosseen schemata are always 
commutative. If a finite stats machine is in re2uced form, commutativity 


and 0,0, lead to the same 


can easily be verified by checking that 91% 2° 


state from any given state in the state diagram. 
commutativity is common because is requires more "states" to recall 


the order of events than to ignore it. Thus, commutative control 
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mechanisms tend to be simpler than non-commutative ones. Furthermore, 
since sequences of the form LT, Ty and LAT, generally reflect the 


situation that ay and g, can occur simultaneously, there are practical 
(and philosophical) difficulties in actually determining the order of 


occurrence. 


7.6.1 CONTROL - Conflict 

Confltet, or the absence thereof, is a joint property of control 
sets and data flow graph topology. A conflict is said to exist if there 
are prefixes ro and xo of control sequences where either 
1) o is the initiation of some actor a such that the output location of 

operator o is an input location of actor a, or 
2) o is the termination of some operator r#o such that r and o have the 

same output location. 
In either case, we say that zo and zo are in conflict. A schema is said 
to be conflict-free if its control set has no conflicts. 

A conflict exists, then, if two events can be enabled after the same 
prefix and both are about to write into the same location or one is 
about to write into a location the other is about to read. If the reader 
senses something dangerous here, his fears are not groundless as will 


become clear in the following sections. 
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7.7.1 Determinacy - An Overview 

A schema which is persistent, commutative and conflict-free and has 
the prefix property may nevertheless exhibit properties which are quite 
unconventional. For example, it may be that many initiated actors are 
left unterminated when computations complete, or there may exist 
interpretations with which no control sequence is consistent. In view 
of such unorthodox possibilites, it might be surprising that such 
schemata are always determinate! Those familiar with the work of Karp 
and Miller will note the similarity of the proof of this fact to an 


analogous result for their schemata’. 


7.8.1 Determinacy - Some Preliminary Notions 

The rules for constructing dadep graphs are such that distinct 
sequences need not generate dadep graphs which are different from one 
another. To make this statement precise, we must define what we mean by 
two sequences generating the "same" dadep graph. 

Suppose x and y are finite, well-defined sequences over a data flow 
graph G. Let D. and > be the dadep graphs of x and y, respectively. 
Then x and y are dadep tndistingutshable if the following four conditions 


hold. 


1) For each event o in the alphabet of events of G, the number of 


occurrences of o in x equals the number of occurrences of o in y. 


2) If there is a 4th occurrence of uninterpreted actor a in x, then the 
, F ‘ .th sess 
action in D_ corresponding to the j occurrence of a is similar to 
x 


. : : a wth R 
the action in D corresponding to the j occurrence of a, and, if a 


90 
is a decider, both decisions have the same outcome label, if any. 


3) If there is a qth occurrence of identity operator o in x, then the 
value in D. corresponding to the 4th occurrence of o is similar to 


the value in D corresponding to the 4th occurrence of o. 


4) If 2 is a memory location label on a value in Di» then & is a label 


on a similar value in vs 


Condition 1 simply means that the dadep indistinguishable sequences are 
made up of the same events, possibly occurring in a different order. 
Conditions 2, 3, and 4 together imply that the differences in order do not 
cause any Significant differences in the values, actions, or memory 
labeling in the dadep graphs. 

For the data flow graph of Figure 2.4, each pair of sequences in the 
following set are dadep indistinguishable, 

{ wwaabh , wawabb , waawbb , waabwb , waabbw , wwabab , wawbab } 
A little combinatorial mathematics reveals that N operators in parallel 
can occur in (2N)! / (Ny distinct sequences. Thus, the three operators, 
w, a, and b, determine 90 such dadep indistinguishable sequences, a 
considerable gain in flexibility over the single sequence of sequential 
schemata. 

If two execution sequences are dadep indistinguishable, condition 4 
immediately guarantees that the sequences are equivalent, and condition 2 


ensures that they are consistent. 
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Equally as important, if x2 and yz are finite control sequences and 
x is dadep indistinguishable from y, then x2 is dadep indistinguishable 
from y2. This follows by induction from the fact that xo and yo are 
dadep indistinguishable if x and y are. This, in turn, can easily be 
verified by considering the five cases involved in the construction of 
dadep graphs. We will carry out the proof here, but will avoid such 


detailed expositions in the future. 


Lemma 7.1: If x and y are dadep indistinguishable sequences and x0 


is well-defined, then xo and yo are dadep indistinguishable. 


Proof: Condition 1 of dadep indistinguishability for xo and yo 
follows imusdiately from Condition 1 applied to x and y. If o is the 
termination of some decider d, let j be the number of occurrences of 
terminations of d in xo. Otherwise, let j be the number of occurrences 
of o in xa. Let dD. and DY be the dadep graphs of x and y, respectively. 


We consider the following cases. 


Case 1) o is the initiation of an actor c other than an identity 
operator. By condition 4, the values in dD. which bear the names of 
the input locations to c are similar to the values bearing these 
labels in De Condition 2 holds for the new action, and conditions 


3 and 4 are unaffected. 


Case 2) o is the initiation of an identity operator c. By condition 4, 
the values in D. and D to which this 5” occurrence of c corresponds 
are similar. Thus condition 3 holds, and conditions 2 and 4 are 


unaffected. 
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Case 3) go is the termination of an operator c which is not an identity 
operator. By condition 2, the actions in D.. and Dy corresponding to 
the ath occurrence of c are similar. Hence, the values added are 
similar, so the values labeled with the name of the output location 
of c in the dadep graphs of xo and yo are similar. This preserves 


condition 4, and conditions ? and 3 are unaffected. 


Case 4) o@ is the termination of a decider c. By condition 2, the 
mes é q .t 
decisions in D,. and Dy corresponding to the j ? occurrence of c are 
similar, and here are given the same outcome label. This preserves 


condition 2 and conditions 3 and -4 are unaffected. 


Case 5) o is the termination of identity operator c. By condition 3, 
the values in Ds and Dy corresponding to the 5th occurrence of c are 
Similar. Thus, the values in the dadep graphs of xo and yo bearing 
the name of the output location of c are similar, preserving 


condition 4. The other conditions: are unaffected. ; 0 


Putting this lemma together with the observation which preceded it, 


we obtain the following lemma. 


Lemma 7.2: If xz and yz are finite execution sequences and x and y 
are dadep indistinguishable, then xz and yz are consistent and 


equivalent. 
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Fe 9. A: Persistence, Commutativity and Conflict - Some Lemmas 
Persistence, commutativity and conflict all deal with well-defined 


sequences of the form LO LO, LIT, and xo,0,. We therefore derive 


BE 21 


some useful lemmas about such sequences. 


The first lemma shows that if o,#0, and o, and o, are not the 


1 1 


opposite terminations of some decider, then if xO, and xo, are well- 
defined sequences which are not in conflict, L055, and cO,0, are dadep 
indistinguishable. The proof, similar to that of Lemma 7.1, involves 
checking a number of cases against the definition of dadep graphs. We 


leave the details to interested readers and only outline the proof. 


and o, and o, are not opposite terminations 


Lemma 7.3: Suppose 9,70, 1 2 


of the same decider. If zo, and xO, are well-defined sequences which are 


not in conflict, then LO 5 o, and zO,0, are dadep indistinguishable. 
Proof: By exhausting all possible combinations of cases. 

Case 1) O,=a, Jy=d. Initiation events do not alter the labeling of 
values with memory locations. The actions are therefore attached to 
the same values independent of order. If either a or b or both are 
identity operators, no action is added, but the value corresponding 


to its occurrence is the one bearing the name of its input location. 


As mentioned, this is not altered by another initiation. 


Case 2) o,=a, 0, 


terminate some actor initiated in x. Notably, the initiation of a 


=b, b an operator. Since xb is well-defined, b must 


can have no effect on the operation to which this occurrence of b 
corresponds. After b has been added to the dadep graph, the memory 


labeling is changed. However, the change only involves the name of 


SY 


the output location of b. If this is an input location of a, xa and 
xb are in conflict. Otherwise, the change does not affect the action 


added by the occurrence of a. 


Case 3) oj 2a, g,=d, or Ty=dp. Since decider outcomes only add decision 


labels, there can be no interference between oy and Fos 
Case 4) o, 7a, o,=b, a anc b operators. If the output location of a is 
the same as the cutout location of b, xa and xb are in conflict. 


Otherwise, the order is immaterial. 


Case 5) T17Ss o,=d,, or o5=d,- As in case 3, there can be no problems. 


d,, or 0,=d_, O,=c,, or o,=e€ If d#e, the decisions referred 


Case 6) 9 )=d, se sec ae 


to must be distinct, and no problems arise. If d=e, then, since 
01*045 they must have opposite outcomes. But we also hypothesized 
that o, and a, were not opposite terminations of the same decider. 


1 2 
Thus it cannot be that dz=e. a 


The next lemma shows that persistence has some global implications. 


Lemma 7.4: In a persistent schema with the prefix property, if xo is 
the prefix of a control sequence and xy is a finite control sequence, 
then o occurs in y, or, if o is a decider termination, the opposite 
termination occurs in y. 

Proof: Since xy is a finite control sequence, y is finite and we can 


“"*a . If a, is o or the decider 


write it aS a sequence of events y=a_a 
q 4 Al n el 


termination of which o is the opposite termination, we are done. 


Otherwise, xa, and xo are prefixes which, by persistence, imply that 


1 


vaso is also a prefix of a control sequence. Repeating the argument for 


Par ees See 
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go and an» O and ag > and so forth, we must either find an a; equal to o or 


the opposite termination thereof, or else xa "*a go is the prefix of 


1°92 
some control sequence. But this would violate the prefix property, for 


xy was a finite control sequence. Thus, such an a; must be found. 0 


Lemma 7.5: In a persistent schema with the prefix property, if xo 
is the prefix of a control sequence and is consistent with finite 
execution sequence xy, then o occurs in y. 

Proof: If o is not a decider termination, this follows immediately 
from Lemma 7.4. So suppose o is the 5h termination of some decider. By 
Lemma 7.4, either o or the opposite termination must occur in y. If the 
opposite termination occurred first, then xo and xy would be inconsistent 
about the outcome of the qh occurrence of the decider. Therefore, o 


must be the next termination of the decider to occur. oO 
The next lemma shows that in a persistent, commutative schema, an 
event which did not occur when it was first enabled can “slide back". 


Lemma 7.6: In a persistent, commutative schema, if xo is the prefix 


of a control sequence and xyoz is a control sequence where neither o nor, 


in the case that o is a decider termination event, the opposite termination 


event, occurs in y, then xoyz is also a control sequence. 


Proof: Suppose yRay “a> where each ay is an event. By 


a? 
2 
hypothesis, no a, is o or the opposite termination event if 0 is a 


decider termination. By the same argument used in Lemma 7.4, rao, 


©a,a,0, and so on through ®a,ay**a, 47 are all prefixes of control 


sequences. Persistence then implies that va,a,’*' 


prefix of a control sequence. Since the schema is commutative, 


a o a is also the 
n-1 n 
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wa,a, "a, _49a.2 is also a control sequence. Repeating the argument n-1l 


times, we can conclude that xoyz is also a control sequence. D 


An easy corollary follows if the schema is also conflict-free. 


Lemma 7.7: In a persistent, commutative, and conflict-free schema, 
if xo is the prefix of a control sequence and xzyoz is a control sequence 
where neither o nor, in the case that o is a decider termination event, 
the opposite termination event, occurs in y, then xoyz is a control 
sequence which is dadep indistinguishable from xyoa. 


Proof: From Lemma 7.3 we know that xva,°°’a. ,a.oa. 
1 inl i “itl 


“a2 are dadep indistinguishable for l<i<n. One need 


“a 2 and 
n 


a Ptea?*aF ie 


ra 
only show that dadep indistinguishability is a transitive relation for 
the result to follow. By inspecting the definition of dadep 
indistinguishability, it is clear that it is an equivalence relation, 


hence transitive. 0 
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7.10.1 Determinacy - Some Sufficient Conditions 
We are now in a position to put some of the preceding lemmas together 


to prove the following determinacy result. 


Theorem 7.8: A persistent, commutative schema with the prefix 
property is determinate if it is conflict-free. 

Proof: Suppose = ard y are I-computations. If both are infinite, they 
are equivalent and we are done. So suppose that x is finite and no longer 


than y. Suppose L=a, ay ay and y=bjb, bo . We will find a sequence 


. 


of I-computations, Lo» Ly > caste xo. such that xs is dadep indistinguishable 


from x and the prefix of length i of xs is b,b,""b;. 


Letting x.=x, we have the basis for our inductive argument. Suppose we 


0 


have I-computation Le satisfying the above conditions. We know that 


1 
£55 bd, Bs 4SsCa ay coe Sincs b>, b,_4b; is the prefix of an 


execution sequence, ¥, which is consistent with x, and hence with Ley 


Lemma 7.5 ensures that by occurs among the ee? Suppose Cy is the first 


q wane fave ec F 
such occurrence. By Lemma 7.7, bib, Be bees cay Ca c, isa 


control sequence which is dadep indistinguishable from Le By the 


L 
transitivity of dadep indistinguishability, this sequence, which we shall 
call Zs» is also dadep indistinguishable from x. 

Now consider x =bib, bo If y is longer than x, bib, be is in 
violation of the prefix property. By hypothesis, x was no longer than y. 


Hence, LY. Since dadep indistinguishability implies equivalence, x=y, 


which is precisely what we needed to show determinacy. O 


98 


pare ears Determinacy - Retrospect 

If the reader followed the last proof carefully, he will note that 
the result is even stronger than we stated. Persistence, commutativity, 
the prefix property, and conflict-freeness not only guarantee that 
I-~computations are equivalent in our sense; they imply that all finite 
I-computations are dadep indistinguishable, mere “permutations” of one 
another. 

Persistence, commutativity, conflict-freeness and the prefix 
property are sufficiently strong that even if our model is extended to 
allow subroutine-like use of schemata by other schemata, determinacy 
is preserved. We formalize how schemata can be so extended and outline 


the proof of determinacy in Appendix I. 
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8.1.1 Conflict and Non-determinacy - Introduction 

We have seen that for persistent, commutative schemata with the prefix 
property, conflict-freeness is sufficient for determinacy. We will now 
turn to a class of schemata for which conflict-freeness is necessary for 
determinacy. We do not investigate such schemata simply "because they 
are there". Non-determinacy is generally something we wish to avoid, not 
to guarantee. By studying how a local problem such as a conflict can 
develop into a global problem of non-determinacy, however, we improve our 
understanding of how parallel systems behave. This understanding may 


help to circumvent non-determinacy in other classes of parallel systems. 
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8.2.1 Paths in Dadep Graphs - Similarity 

Paths in a dadep graph show how actions depend upon the initial 
values and output values of operations. To gain some additional 3 
familiarity with the properties of paths in dadep graphs, we establish 
a simple result about paths and similarity. 

Recall that arcs from values to actions are labeled with numbers. 
If an action is m-ary for some positive m, there are arcs into the 
action with labels 1 through m. Arcs from operations to values are 
unlabeled since there is precisely one such arc, if any. 


3& 


A path m in a dadep graph is a sequence of arcs a a 


ety? 
such that for 1<is<n, the node from which arc as emanates is the node 
to which arc Os leads. Let us agree that the ae node on path 7 is 
the node from which arc a, emanates, and the nt1t (and final) node 
on path m is the node upon which arc a, terminates. We say that ™ is 
a path from the first node of m to the last node of 1. 

We say that path TFG, 055°" 50, in dadep graph Dy is sintiar to 
path %5=B, Bos 2B. in dadep graph D, if all of the following 


eonditions hold: 
1) m=n. That is, the paths have the same length. 


2). For l<i<m, either a; and B; are both unlabeled, or both have the 
same label. It follows that two similar paths both must begin on 
value nodes, or both must begin on action nodes. 


3) For 1si<mtl, the ri node on path ™ is similar to the 7 node on 
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We can relate similar paths to similar nodes by the following 


straightforward but useful lemma. 


Lemma 8.1: Let ,=a, ,a in dadep graph 


pF > po 3b, be a path to node n 


uk 
Di: Node ny in dadep graph D, is similar to ny if and only if there is 
a path 75=B, Bo» 2B to n, in D, which is similar to 1 


Proof: Since the final nodes on similar paths are similar, one half 
of the lemma is trivial. 


Suppose ny and ny are similar. If ny is a value node, O must be 


the unlabeled arc from the operation o, of which n, is the output value. 


a 1 


Since n, is’similar to D> it must be the output value of an operation 


2 


0, Similar to o,. Let BF be the unlabeled arc from o, to Dy: 


2 HE 2 


Suppose ny is an action node. Then a, must be the arc labeled i 


from Vi» the ge input value of n 


reg input value of Ny» must be similar to vy: Let BF be the arc labeled 


i Since n, is similar to Ny» Vos the 


i from Vo to No: 


Working our way back in this manner, it is obvious that we can 


construct ™,=B,,8,,°°°, 8 similar to 1). 0) 


Two corollaries follow easily from Lemma 8.1. 


Lemma 8.2: If ny and Ny are similar nodes in dadep graphs Dy and Do» 


and if my is a node on a path to ny> then there is a node mM similar to 


m, On a path ton 


1 2° 


are dadep graphs and n, is a node in D 


Lemma 8,3: If Dy and D 1 1 


2 


which is not similar to any node in Dy» then no node on a path from ny 


is similar to any node in Dy + 
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8.3.1 Schemata - Some Additional Properties 

Since the first determinacy result did not depend upon properties 
other than those mentioned, we made no attempt to further limit control 
sets. We now discuss some properties which lead to behavior more in 
line with the systems we mentioned in the introduction. 

In a speed-independent system, one can make no assumptions about the 
length of time between the reading of an m-tuple of values and the 
completion of the processing thereon. If a control mechanism cannot 
block termination events, they must be anticipated at any time after 
the corresponding initiation event has occurred. 

To make this precise, we say that a schema has the tmmediate property 
if, whenever x is the prefix of a control sequence and either o is an 
operator such that xo is well-sequenced or d is a decider such that ad, 
and zd, are well-sequenced, then Xo or zd, and xd, are prefixes of 


control sequences. 


Sequential schemata, as we have noted, have exactly one I-computation 
for each interpretation I. We generally expect parallel schemata to have 
more than one I-computation, but we would hardly expect them to have none 
at all. However, there are data flow graphs for which {dd} satisfies 
all the conditions necessary to qualify as a control set. It is even 
persistent, commutative and conflict-free, and it has the prefix 
property. Any interpretation I which specified a false outcome for d's 
predicate would have no I-computations. 

Even the fact that a control set contains an I-computation for each 


interpretation I is not sufficient to guarantee reasonable behavior. 
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For example, {daad,,,dbbd.} could be such a control set for some data flow 
graphs. If we view the sequence of events as occurring through time, 
however, we see that once da has occurred, somehow d cannot terminate 
with outcome false. This runs counter to our intuition about how control 
mechanisms and decisions interact. 

To preclude some of this pathological behavior, we define a property 
called completeness. A schema is complete if, for every prefix x of a 
control sequence and for every interpretation I with which x is consistent 
there is an I-computation zy. In a complete schema, it is impossible to 
"pun into a dead-end" under any interpretation. 

It might seem that a schema with the immediate property would always 


be complete, for whenever xd, is a prefix, so is zd... Unfortunately, 


T 
this is not enough to guarantee completeness. For example, the control 
set d(d,aad)*d,, has the immediate property, but, having no infinite 


sequences, there is no I-computation for the interpretation in which d's 


predicate is always true. 
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8.4.1 Productivity - Overview 

We now turn to a very important notion, that of producttvtty. 
Productivity formalizes the concept of an event in a computation 
"accomplishing something useful." Initiating an actor in the course of 
a computation can be justified in a number of ways (as one can 
empirically verify by asking a programmer, "Why is that instruction 
in your program?") At the most trivial level, an actor is initiated 
to carry out a particular transformation or test. At a more 
satisfactory level, one can explain how the output of an operation 
will be used by subsequent actors, or what will happen if a decision 
comes out true. Best of all, the effect of an action on the fuout/ 
output behavior of the schema could be pointed out. From a modular 
point of view, only the last justification is truly relevant. 

We will define, in several cases, what we mean by the occurrence of 
an actor being productive in a sequence. The cases are rather involved, 
but the fundamental ideas are straightforward. For notational 
convenience, we define a schema output value to be any value in the 
dadep graph of a finite control sequence labeled with the name of some 
schema output location. In other words, a schema output value 
corresponds to the final contents of a schema output location. 

An occurrence of an operator which is not an identity operator will 
be productive in a finite control sequence if the value it produces is 
a schema output value, or influences a schema output value, or influences 
a decision in the sequence. A decision will be productive if there are 
two non-equivalent control sequences between which the decision outcome 


arbitrates. An occurrence of an identity operator in a finite sequence 
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is productive if the location to which the value is moved is a schema 
output location which is subsequently unchanged, or is the input location 
of a subsequent productive actor initiation. After formalizing these 


ideas, we will philosophize some about their meaning. 


8.4.2 Productivity - Formal Definition 

Suppose w=acy cz is a finite control sequence where the indicated 
occurrence of uninterpreted opirator c is the hae such in w. Let D be the 
dadep zraph of w, and let v be the value which is added to D when c 

. .th |. .th . F 
terminates for the Jj time. We say that the j occurrence of c in w is 
productive if v is a schema output value of D, or lies on a path toa 


schema output value of D, or lies on a path to a decision in D. 


Suppose w,=xdz_, and W, =xdz 


1 1 are control sequences where the indicated 


2 


occurrences of decider d are the a such in w, and Wy. We say that the 


1 


i occurrence of d is productive for wy and w, if w, and w, are not 


equivalent, there exist oe terminations of d in w, and Wy having 


1 

opposite outcomes, and all other decisions in wy and W, are consistent. 
Finally, suppose Wexcy cz where the indicated occurrence of identity 

operator c with output location & is the eg such inw. Let D be the 


dadep graph of w. We say that the Fi occurrence of c in w is productive 


if any or all of the following hold: 


1) & is a schema output location and no operator terminating in 2 has 
output location &. (That is, ec moves a value to a schema output 


location where it remains.) 


2) is of the form z,bz,bz., where the indicated occurrence of operator 


b is productive inw, & is an input location of b, and no operator 


LO6 


terminating in By has output location 2. (That is, the value moved 
to location 2% is subsequently used from there by a productive 


operator.) 


3) 2 is of the form 2,02, where d is a decider which has 2 as an input, 
and no operator terminating in zy has output location 2. (That is, 


the value is moved to a location from which it is used by a decision. ) 


8.4.3 Productivity - What Does it Mean? 

The definitions of productivity just given probably look mysterious 
at best, but once understood, they are quite natural. For example, our 
definition of decider productivity follows from the common-sense 
principle: Don't ask questions if you don't care about the answers. 
This translates into our requirement that under some circumstances, the 
answer to just 1 question determines which of two non-equivalent 
computations take place. An example might help to clarify this point. 


Consider the following program and tabular representation of the output. 


input ( X ) pC X ) q(x.) OUTPUT 
ci ae T oT £(X) 


then if q ( X ) then X*« £( X) else X* g(X) T F g(x) 


else if q ( X ) then X* f£( X) else X*+ g(X) FT £(X) 


output ( X ) 


ins 


Fg(X) 


From the second and third lines of the table summarizing the 
behavior of the program, it can be seen that if p(X) is true, the 
output may be g(X), and if it is false, the output may be f(X). 


However, we do not consider the occurrence of p in the sequences 
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PPrda ff and PP pdq88 productive because the sequences are not consistent 
about the decision made by q. There are no sequences for which p is 
productive which is probably in line with the reader's intuition about 
the usefulness of the test p(X) in the program. Our definition would 
find productive occurrences of q, however, which is again what one would 
expect. 

The other forms of productivity are also grounded in common sense 
principles. They can be paraphrased, "Don't generate values you aren't 
going to use!" and "Don't move a value somewhere unless you need it 


there!" 


8.5.1 Productivity of Schemata - Formal Definition 

Knowing what it means for an occurrence of an actor to be productive 
in sequences, it is easy to extend the notion of productivity to 
schemata. In fact, there are several ways in which we could extend tne 
definition, and we shall mention these briefly after our definition of 
weakly productive schemata. 

A schema is said to be weakly operator productive if, for every 
prefix xo of a control sequence, there is a finite control sequence 
xoy in which the indicated occurrence of operator o is productive. 

A schema is said to be weakly dectder productive if, for every 
prefix xd of a control sequence, there are control sequences ady, 
«dy, for which the indicated occurrence of decider d is productive. 

A schema is said to be weakly productive if it is both weakly 


operator productive and weakly decider productive. 
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8.5.2 Productivity of Schemata - Discussion 

In plain English, a schema is weakly productive if it does not 
initiate actions which cannot be useful. We have called this weak 
productivity because it is possible to impose much stronger conditions. 

For example, we could define a schema to be strongly operator 
productive if, for every prefix xo of a control sequence, the occurrence 
of o is productive in every finite control sequence LOY. 

Analogously, we could define a schema to be strongly dectder 
productive if, for every prefix xd of a control sequence, the occurrence 
of d is productive for all control sequences xdy and «dz in which the 
occurrences have opposite outcome. Or, we could require that for each 
control sequence xdy, there exists a control sequence xdz in which the 
occurrence is productive. Or, we could require that there exist a 
control sequence xdy such that the occurrence of d is productive for all 
control sequences xdz in which the termination has opposite outcome. 

Putting these variations on operator productivity and decider 
productivity together, we could obtain many classes of productive schemata. 
We will concentrate on weakly productive schemata because they are the 
largest of the classes and seem the most natural for encouraging parallel- 
ism: One can initiate actors if there is a possibility they will be useful. 

Before leaving this discussion of productivity, we would like to make 
one observation. The Karp-Miller formalism for parallel schemata, because 
of the stronger form of equivalence and the way all actors alter memory, 
are inherently "very productive". We feel that this quality contributes 
substantially to the cleanness of their mathematical results, We pay for 


greater generality with more complicated proofs. 
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8.6.1 Schemata - Repetitions 

The properties we have previously defined were ones that we found 
natural or desirable for a schema to possess. The next two properties 
are introduced less because they are natural than for the reason that 
they make the study of schemata more tractable. For example, the 
absence of these properties is intimately involved in Paterson's proof 
of the undecidability of the equivalence problem. The properties 
involve the notion of doing the same thing more than once in the course 
of a computation. 

A sequence x is said to be free if no dadep graph of a prefix of x 
contains distinct, similar decisions. A sequence x is said to be 
itberal if no dadep graph of a prefix of x contains distinct, similar 
operations. A sequence is said to be repettitton-free if it is both 
free and liberal. 

A schema is said to be free [ltberal, repetttton-free] if all of 
its control sequences are free [liberal, repetition-free.] One nice 
feature of free schemata is that all control sequences are execution 
sequences. This is easily seen if one recalls that a control sequence 
which does not contain similar decisions with opposite outcomes is an 
execution sequence. Since free sequences do not contain similar 
decisions at all, they cannot have any with opposite outcomes. When 
dealing with free schemata, then, we will use the terms control sequence 


and execution sequence interchangeably. 
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8.7.1 Weakly Productive Schemata A Preliminary Result 


As an example of the intuitive appeal of weakly productive schemata, we 
prove a straightforward theorem. The proof will introduce a useful proof 
technique for productive schemata. 

Suppose we define a reduced scnema to be a schema for which every 
actor in the data flow graph is used in at least one control sequence. We 
will show that two free, determinate, reduced schemata cannot be equivalent 


if they employ substantially different functions or predicates. 


Theorem 8.4: Let S|, and S, be free, weakly productive, reduced, 


zs 2 
determinate schemata. Sy is equivalent to 5, only if PREDICATE NAMES, = 
PREDICATE NAMES, and, except possibly for the identity functic: name, 


2 


FUNCTION NAMES | = FUNCTION NAMES, 


Proof: Suppose uninterpreted operator o of Sy has function name f 


and £ ¢ FUNCTION NAMES. Sy is reduced, so there is a control sequence in 


which o occurs, say “oy. Since Sy is weakly productive, there is a finite 
control sequence xos in which the occurrence of o is productive. Because 


Sy is free, xoz is an execution sequence. Suppose the operation oy which 


corresponds to the indicated occurrence of o lies or a path to a schema 


output value in the dadep graph of wos. Since Oy has function name f, no 


operation in a dadep graph of a control sequence from S, could be similar 


to o,. By Lemma 8.3, it follows that no dadep graph of a control sequence 


of S, could contain a schema output value similar to that to which Oo, 


leads, tence, no sequence from Sy, could be equivalent to noes 


Since ©, cannot lie on a path to a schema output value if the 


schemata are to be equivalent, it must be that o, lies on a path to a 


1 
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decision dy in the dadep graph of xoz. We can write xoz as woz, 42,5 
where the action corresponding to the indicated occurrence of d is d,- 
By the weak productivity and freeness of Si» there exist execution 


sequences xoz, dv, and woz, dv, for which the occurrence of d is productive. 
By definition, the only inconsistent decisions in these sequences are the 


indicated occurrences of d. Therefore, there is an interpretation I, in 


which the indicated occurrence of d is true and which is consistent with 


all other decisions in both sequences. Let I, be identical to I 


F 7 except 


at the outcome of d. One of the sequences is an I,-computation, and the 


T 


other is an I,-computation. 


If So has no I,,-computations , then Si and S. are obviously not 


equivalent schemata. So suppose w is an T,,-computation of Sy. We 


claim that w is also an I, computation, for, if not, it must contain a 


decision similar to dy having true outcome. But dy lies on a path from 


O71 and no dadep graph of a sequence from S5 can contain an operation 
similar to 1° By Lemma 8.3, neither can it contain a decision similar 
to d 


1 


Thus, w is both an I,-computation and an I_-computation. Since 


T E 


woz, dv, and xoz, dv, were not equivalent, w is equivalent to at most 


one of them. It is therefore consistent with but not equivalent to the 


other, so S_ and So cannot be determinate and equivalent. 


1 


Exactly the same argument applies if there is a predicate name in 


PREDICATE NAMES, ~ PREDICATE NAMES... 0 
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8.8.1 Productivity - Undecidability 
The reader may have foreseen that productivity is "too good to be 
decidable." In fact, the proof that productivity is not generally 
decidable follows easily from the unsolvability of the equivalence problem. 
Suppose we take two m-input n-output Paterson schemata whose 
equivalence we would like to determine. As in the proof of the 
undecidability of the determinacy question, we can assume ACTORS, and 


ACTORS, are disjoint. Since the schemata have the same number of input 


locations and output locations, we can form, as before, a composite data 
flow graph by identifying the input locations of the schemata and the 
output locations of the schemata. 

Let p be a unary predicate name which is not in PREDICATE NAMES, U 


PREDICATE NAMES,, and let f be a O-ary function name which is not in 


2? 


FUNCTION NAMES) u FUNCTION NAMES,,. Add to the composite data flow graph 


a memory location which is the output location of a new operator o with 
function name f, and the input location of a new decider d with predicate 
name p. Form the composite CONTROL by prefacing each sequence in CONTROL, 
with oodd,, and each sequence in CONTROL, with oodd... We claim d is 


productive if and only if S, and S, are not equivalent. This is true 


1 


because d is productive if and only if there exist sequences oodd and 


nap 
oodd 2, which are not equivalent but are consistent except for the outcome 


of d. This obviously means that ay and z, are consistent, non-equivalent 


sequences, that is, S, and S, are not equivalent. 


1 
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8.9.1 Determinacy and Conflict - Some Additional Results 

For the remainder of this chapter, we will investigate a class of 
schemata for which conflict-freeness is necessary for determinacy. We 
are not suggesting that the properties which cause this problem with 
determinacy are desirable. On the contrary, we are much more concerned 
with ensuring determinacy than with precluding it. By understanding how 
conflict leads to non-determinacy, however, we are in a better position 
to avoid non-determinacy in schemata. 

We begin by establishing a number of lemmas concerning repetition-free 
schemata in which there are no identity operators. Let us fix some such 
schema, S. For notational convenience, we will use DADEP(x) to denote 
the dadep graph which is determined by the data flow graph of S and a 
well-defined sequence x. If LO Yo, is a sequence which is well-defined 


for S, then we say that event o, tnfluences event o, if there is a path 


1 
in DADEP (xo Yo 4) from the action corresponding to the indicated occurrence 
of g, to the action corresponding to the indicated occurrence of Tos 

The lemmas which follow share a great deal of notation. We shall 
introduce the common notations here to avoid the necessity of repeating 
the definitions with each lemma. 

Let xa and xo be prefixes of control sequences of S such that za and 
wo are in conflict. Let w  =aoyb and w=woayb be prefixes of control 


sequences of S. Let a, be the action in DADEP (w, ) corresponding to the 


Al 
indicated occurrence of a, and let a, be the action in DADEP(w, ) which 
corresponds to the indicated occurrence of a in Wo. Similarly, let bi 


and b, be the actions in DADEP (wy, ) and DADEP(w,,) , respectively, which 


correspond to the indicated occurrences of b. 
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We shall prove the following lemmas. 


Lemma §&.5: a, is not similar to ay: 


Lemma 8.6: b is influenced by a in w, iff b is influenced by a in w 


L 2° 


Lemma 8.7: b, is similar to b, iff a does not influence b in Wy 


Lemma 8.8: If S is commutative, there is no prefix xoaz of a control 
sequence of S such that DADEP (xoaz) contains an action similar to a> and 
no prefix xaoz of a control sequence such that DADEP(xaoz) contains an 


action similar to a- 


Lemma 8.9: If xaoz and xoaz are control sequences of S, they are 


consistent. 


Proof of Lemma 8.5 (a, is not similar to ay-): 

Since xa and xo are in conflict, m, the output location of operator o, 
is an input location, say the Phan of actor a. Because xa is well- 
defined, m must be the label on some value v in DADEP(r). Since xo is 
well-defined, there must be an operation o' in DADEP(x)} to which this 
termination of o corresponds. In DADEP(xo), a value v' is added as the 
output value of o', and label m is removed from v to become the label’ on 
v'. If v and v' were similar, v would have to be the output value of an 
operation similar to o'. But this operation and o' would be distinct, 
similar operations, contrary to the hypothesis that S$ is repetition-free. 
Therefore, the he input value of a, in DADEP(za) is not similar to the 


1 


th . 2 - — 
k~ input value of a, in DADEPGroa). It follows that a) and a, cannot be 


similar. Oo 
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Proof of Lemma 8.6 (b is influenced by a in w, iff b is influenced by 


ain Woe): 


Note that except for the order of occurrence of a and QO, Ww, and W, are 


identical. If the indicated occurrence of a in Ws is not terminated, then 


neither is the occurrence of a in Wo » and b is uninfluenced in both 


sequences. If y=c nee and cy is the termination of a in w,, then C; 


1°32 1? 


is also the termination of a in Wa Tt is easy to see that the events 


influenced by C; in w, are the same as those influenced by Cy in Wr» 


zh 


from which the lemma follows. oO 


Proof of Lemma 8.7 (b, is similar to by iff a does not influence 5 


in W,.): 
in w) ) 


We split the proof into halves. In both cases, we assume the 


contrary and derive a contradiction. 


Suppose by is similar to b, and there is a path from ay to by: We 


can assume that y=c — is the shortest sequence after which such 


162 
similar, influenced initiations occur. The nodes on the path in DADEP(w, ) 


from a, to by all correspond to events occurring in y. Let & be the 


last operation termination to which a node on the path corresponds. 
(There must be at least one operator termination in y since a must 


.) It follows that b, takes as an 


terminate if there is a path from a 1 


1 


input value the output value of the operation which . terminates. It is 


also true that by takes as an input value the output value of the operation 


which a termimates in w,. Since by and b, are similar, these operations 


2 2 


must be similar. By Lemma 8.5, we know that ay and a, are not similar, 


so ee cannot be the termination of the occurrences of a in Wy and Ws 
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It follows that es must be the termination of an operation initiated in y, 
say by event Cy. Both ey and c; are influenced by a in Wy» and we have 
concluded that the operation corresponding to Cc; in DADEP(w, ) is similar 
to the operation corresponding to Cs in DADEP(w,). This contradicts our 
hypothesis that y was the shortest sequence after which such initiations 
could occur. We can conclude that if by and by are similar, then b is 


not influenced by a in W)- 


To obtain the other half of the lemma, assume that b, and bo are not 


1 


similar, that there is no path from a, to b, in DADEP(w, ), and that 


y=c, Cc," c. is the shortest sequence after which these conditions can 
arise. If b were a O-ary operator, by and by would be similar, so we 
can assume that b has at least one input location. The function names 
or predicate names on actions by and b, are, of course, the same, so it 
eth . . ae th . 

must be that the i” input value of b is dissimilar to the i input 

' value of by for some i. Let m be the rae input location of b. If no 
operator terminating in w, has output location m, then m must be a 


th 


schema input location, say the ae and the i'” input value of by is 


contains exactly the same termination 


events as does W1> SO the ie input value of b, would also be the are 


the ath initial value. “Sequence Ww, 
initial value, contrary to the assumption that the ae input values of b 
and bo are dissimilar. So some operator terminating in wy has output 
location m. Suppose the co occurrence of operator c in wy is the last 
operator with output location m to terminate in Wy: It follows that the 
h 


output value of the operation in DADEP (a, ) corresponding to the xt 


occurrence of c is the ies input value of b It also follows that the 


i. 
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output value of the operation in DADEP (w, } corresponding to the ia 


: th . : : : 
occurrence of c is the i” input value of bos Since the ih input 
values are dissimilar, so must be the operations. We now show that no 


matter where the oe initiations of c take place in w, and Wy» some 


1 
hypothesis is contradicted. 


Case 1): The as initiation of c in w, occurs in prefix x. The , th 


initiation of c in W must also occur in x, and the corresponding 


operations would be similar, contrary to hypothesis. 


Case 2): The th initiation of c in wy is the indicated occurrence 


of a. This contradicts the assumption that there is no path from a, to bi: 


Case 3): The co initiation of c in wy Cand Wo) is event c 


It follows that y is not the shortest sequence after which the hypothesized 


h in y. 
conditions can arise. 
We conclude that if by and b, are dissimilar, then b is influenced by 


ainw,. 0 


Proof of Lemma 8.8 (If S is commutative, there is no prefix xoaz of a 
control sequence of S such that DADEP (zoaz) contains an action similar to 
ay» and no prefix vaoz of a control sequence such that DADEP(xaoz) 
contains an action similar to ays): 

We prove only the first half of the lemma. The second half follows 
by similar arguments. We assume that there is a prefix goa of a control 
sequence of S such that DADEP(xoaz ) contains an action a, similar to ay: 


We snow that ro matter where the initiation of the action occurs, we can 


aerive a contradiction. 
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Case 1): The initiation event to which a, corresponds occurs in prefix 


x. Then xa is the prefix of a control sequence containing a repetition, 
contrary to hypothesis. 


Case 2); The initiation event to which a, corresponds is the indicated 


occurrence of a in xoaz. By Lemma 8.5, ay is not similar to ays 


Case 3): The initiation event to which a, corresponds takes place 


after prefix xoa. We can assume that ay corresponds to the initiation of 


b in xoayb . Let us hereafter refer to action a, as action b, so we can 


make use of the notation we have used in the previous lemmas, By 


assumption, by is similar to a,. Since S is commutative, w 


1 =xaoyb is also 


1 


the prefix of a control sequence. 


Case 3a): b is not influenced by a inw,. Lemma 8.7 implies that b 


1 


is similar to bo. Since bo is similar to ay» by must be similar to a 


1 


1? 
contrary to the hypothesis tbat S is repetition-free. 


Case 3b): b is influenced by a in w There is therefore a path from 


3° 
ay to by in DADEP(w, ). By Lemma 8.6, there must also be a path from ao to 


b, in DADEP(w,,). Since b, is similar to ay» it follows from Lemma 8.2 


that there is an action similar to a, on a path toa 


5 in DADEP(w, ). 


1 
Actions on a path to ay» however, would have to correspond to events 


occurring in prefix x. Prefix xoa would therefore contain a repetition 


of action a,» contrary to hypothesis. 


It follows that no action in DADEP (aroaz ) can be similar to aye 
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Proof of Lemma 8.9 (If xaoz and xoaz are control sequences of S, they 

are consistent.): 
‘ ies .th 

We assume the contrary and derive a contradiction. Suppose the j 
occurrence of decider b in xaoz is inconsistent with some decision in roaz. 

Case 1); The ach initiation of b in xaoz takes place in prefix x. 

oe x . ae, ate woist ay. Ode : = 
The decision corresponding to the j " initiation of b in xoaz would then 
be similar and would have the same outcome. No other decision in xoaz 
could be similar unless there were a repetition, so no inconsistency is 
possible. 
eth soho . = . ere 

Case 2): The j initiation of b in xaoz is the indicated occurrence 
of a. By Lemma 8.8, there is no decision in xoaz which is similar. 
Therefore, there is no inconsistent decision in xoaz. 

Case 3): The “ao initiation of b in xraoZz takes place after prefix 


: = Ale ere .th 
xao. Let us write xaoz as raoyby' to indicate the j occurrence of b 


and to match our previous notations. 

Case 3a); b is not influenced by ain zaoyby'. By Lemma 8.7, the 
decisions corresponding to the occurrences of b in saoyby ' and xoayby' 
are similar and they have the same outcome. As in the first case, no 
inconsistency can arise unless there is a repetition. 

Case 3b): b is influenced by a in waoyby'. This means there is a 


path from a, to b, in DADEP(xaoyb). By Lemma 8.8, no prefix of xoaz has 


1 


a dadep graph containing an action similar to a By Lemma 8.3, there 


Ll 
can bs so decision in xoaz similar to b> hence no decision inconsistent 


with it. 


We conclude that waogz and xoaz are consistent. ia 
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With these lemmas to build upon, we can now show how one kind of 


conflict leads to non-determinacy. 


Theorem 8.10: Let xa and xo be prefixes of a complete, persistent, 
commutative, repetition-free, weakly productive schema with no identity 
operators. If xa and xo are in conflict, then the schema is not 


determinate. 


Proof: Suppose xa and xo are in conflict. We will show that 
consistent, non-equivalent execution sequences exist, demonstrating 
that the schema is not determinate. By the persistence of the schema, 
toa is a prefix of a control sequence. 

Case 1): Actor a is an operator. Since the schema is weakly 
productive, there is a control sequence xoaz in which the occurrence of 
a is productive. 

Case la); msoaz is finite and there is a path in DADEP(xoaz ) from 
the action ay corresponding to the indicated occurrence of a to some 
schema output value, say that value labeled by schema output location nm. 
Since the schema is commutative, xaoz is a control sequence and, by Lemma 
8.9, it is consistent with xoaz. By Lemma 8,8, there is no operation in 


DADEP(xaoz) similar to a By Lemma 8.3, no value in DADEP (araoz ) is 


> 
similar to the value in DADEP (xoaz ) with label m. Thus, xwaoz and xoaz 


cannot be equivalent. Since the schema is repetition-free, these are 


consistent, non-equivalent execution sequences. 


21. 


Case 1b): wxoaz can be written xoayby' where the indicated initiation 
of decider b is influenced by a. Appealing the the weak productivity of 
the schema, there are control sequences v  =woaybe , and v ,=oay bz, for 
which the occurrence of b is productive. Let I, be any interpretation in 
which the decision corresponding to the indicated occurrence of b is true, 
"Sea which is consistent with all other decisions in v, and v,. Let I, be 


identical to I,, except at the outcome of the decision corresponding to b 


T 


which I, specifies as false. Since the schema is complete and rao is 


consistent with I,, there is an I,,-computation sxaou. We claim xaou is 


Tp 
also an T,-computation , for, by Lemma 8.8, the dadep graph of any prefix 
of xaou can contain no operations similar to ay: Since ay is ona path to 
the only decision about whose outcome I, and I. differ, it follows from 
Lemma 8.3 that there is no similar decision in xaou. Since vy and v, are 


not equivalent, xaou is equivalent to at most one of them, but is consis- 


tent with both. Therefore there is a non-determinacy. 


Case 2): Actor a is a decider. This is virtually identical to case 


lb, so we omit the proof. O 
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8.9.2 Determinacy and Conflict - Continued 

Prefixes xa and xb of control sequences were also said to be in 
conflict if a and b were distinct operators with the same output location. 
We will show that this kind of conflict can also cause non-determinate 
behavior. Many of the arguments we use are virtually identical to those 
in the case of conflict of the form xa and xo. We will therefore go into 


less detail in the following proofs. 


Lemma 8.11: Suppose zab and xba are prefixes of control sequences in a 
commutative, repetition-free, weakly productive, complete schema with no 
identities. Suppose b is productive in control sequence xabz. If za and 


xb are in conflict, then the schema is not determinate. 


Proof: Let & be the memory location which is the output location of 


both a and b.. If value vy in DADEP(xab) with label & is similar to the 


value v, in DADEP(xba) with label &, there is clearly a repetition in xab. 


For the same reason, there can be no input value of an action in the dadep 


graph of a sequence with prefix xba similar to ay since a's termination 


removes the label from a value similar to vy- Analogously, no action 
after prefix ab can have an input value similar to Vo° 

As in Lemma 8.7, it can be seen that the actions corresponding to the 
indicated occurrences of c in mabyc and xbayc are similar if and only if 


there is no path from v. to the action in DADEP(wabyc} . From this, as in 


1 
Lemma 8.9, it follows that zabz and xbaz are consistent. Since b was, by 
hypothesis, productive in control sequence zabz, the same arguments used 


in Theorem 8.10 apply and a non-determinacy can be shown to exist. D 
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The last lemma shows that conflicts of the type Xa and Xb cause 
trouble when b is productive. We now show that conflicts of this type 


imply the existence of conflicts involving productive operations. 


Lemma 8.12: Let xa and xb be conflicting prefixes of a persistent, 
commutative, weakly productive schema with the prefix property and the 
immediate property. Either there exists a conflict of the form vo, ve, 
or there exist sequences vcoz and vocz in which c is productive and vc 
and vo are in conflict. 

Proof: We can assume, without loss of generality, that a initiates 
before b in prefix x, say x= x ax, br, « Since the schema is weakly 


1 

productive, there is a finite sequence wat bz in which the occurrence 
of b is productive. If a does not terminate in 2, then, by the immediate 
property , a, ax bea is the prefix of a control sequence, in violation of 
the prefix property. So we can assume both b and a terminate in 2. 

Suppose a terminates before b so a= = a2 5 bz, Let By=C1 Cy “Te. 
Using the immediate property, persistence and commutativity, we can 
"slide b to the left" as in Lemma 7.6. That is, wae bz, ac," "be," "e284 
is also a control sequence for lsi<n. From Lemma 7.7, we know that the 
sequences formed by moving b are dadep indistinguishable unless b reaches 
ac, such that & am ba, ac,c," reer and was pb acco’ ""e, 4b are in 
conflict. This will certainly happen when b reaches a. But when it 
first happens, we have a conflict of the type hypothesized in the lemma. 

So suppose b terminates before a in g so that 2=2,D2,a% 4. Since a 
overwrites the location & written by b, b could not be productive in the 


sequence unless some actor initiating in zg, has 2 as an input location. 


2 


124 


Let us write 2, as 3 where actor c has input location &. As above, 
a 


91799 


a are prefixes 


i ft s a axb c ax 5 
a Can Migrate Left so that Lax be bz ,4¢ and x, ax, ba, bz, 4 


in conflict as hypothesized. " 


Putting the last two lemmas together with Theorem 7.8 ard Theorem 8.10, 


we obtain the following major result. 


Theorem 8.13: A persistent, commutative, complete, repetition-free, 
weakly productive schema without identity operators which has the prefix 
property and the immediate property is determinate if and only if it is 


conflict-—free. 
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8.10.1 Theorem 8,13 - A Post-Mortem 

Theorem 8.13 contains a not inconsiderable number of conditions, and 
the reader may question whether any schemata of interest can satisfy all 
of them. To allay such doubts, we will review the nature of these 
conditions. 

Completeness, as we mentioned, is such a natural property that one 
would probably not bother to point out that a schema model possesses it. 
Practically any control mechanism which anticipates both outcomes of 
decisions will automatically ensure completeness. Similarly, the prefix 
property is generally satisfied by definition: finite control sequences 
are defined to be those for which no further activity is possible. Thus, 
completeness and the prefix property do not seriously restrict the class 
of systems we can model. 

Persistence and commutativity are natural properties for systems with 
multiple, independent loci of control. These properties reflect practical 
limitations on the ability of such systems to co-ordinate activity. 
Commutativity might be viewed as accepting that if two events can occur 
arbitrarily close together in time, it makes little sense to behave 
differently on the basis of order of occurrence. Similarly, persistence 
can be considered to model the inability to cancel activity on 
arbitrarily short notice. Of course, non-commutative and non-persistent 
control mechanisms can be defined, and they may exhibit certain 
acvantages. We simply suspect that practical systems will tend to be 


persistent and commutative. 


£ 
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Weak productivity, even though it is not generally a decidable 
property, appears to be a desirable one in practical terms. The increase 
of concurrency in computation should not arise solely from the introduction 
of unproductive activity. We should point out that parallelism, 
persistence, and productivity are occasionally at odds with one another. 

A simple example of this is the case where we wish to perform some action 
if either or both of two decisions are true. Parallelism encourages us to 
allow the decisions to proceed concurrently. If one decision terminates 
with true outcome before the other has even initiated, however, 
productivity would incline us to cancel the other decision, whereas 
persistence forbids cancellation. Clearly, some trade-offs between 
parallelism and productivity must be made. 

Whether Theorem 8.13 remains valid if identity operators are allowed 
is not known. Using an extended notion of repetition-freeness which 
prohibits use of an identity operator to restore the contents of a 
memory location to a value similar to one it held earlier in the 
computation (X:=X being a simple violation), we were unable to disprove 
the theorem. However, certain of the lemmas we used to prove the 


dd, 


theorem certainly cease to apply. For example, sequences a=ababcc,, E 


and y=abbacc,dd for the data flow graph of Figure 8.1 are not consistent 


F 


since the decision corresponding to Cr in « is similar to the decision 


corresponding to d, in y. Thus, the analogue to Lemma 8.9 fails. 


F 


Although we suspect the theorem remains true nevertheless, a different 


approach will be needed to prove it. 
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SCHEMA INPUTS = {X,2} 


Figure ©.1 Data flow graph with identities for which conflict can 


cause inconsistency 
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Repetition-freeness and the immediate property are less pervasive than 
some of the other properties mentioned, but they are by no means unlikely 
to be observed. The immediate property, like persistence and commutativity, 
will tend to be present when there is considerable independence among loci 
of control. Sequential schemata, fork/join formalisms, and Dennis-—Fosseen 
schemata always have the immediate property. Petri net and finite-state 
controls can easily be checked for the immediate property. 

Repetition-freeness in sequential schemata is decidable!!, and it 
appears that t-counter transttton systems as described in an appendix of 
Slutz's thesis!® can be used to determine repetition-freeness in fork/join 
controls, Dennis-Fosseen schemata, Petri net models in which the number 
of tokens in the net is bounded, and finite state models in which the 
number of "unterminated initiations" of actors is bounded. We know of no 
decision procedure for repetition-freeness of unbounded Petri net or 
state-machine controlled sche: ata. As shewn by Paterson!!, the freeness 
of an illiberal schema is rot decidable. 

Conflicts in sequential schemata and Dennis-Fosseen schemata cannot 
arise. Conflict-freeness for fork/toin, Petri net, and finite state 
controlled schemata is decidable. 

Thus, the conditions in Theorem 8.13 are not so unusual or restrictive 
as one might first suspect. The reader can verify that the various 
implementations of our sample program in Chapter 3 are all persistent, 
commutative, complete, repetition-free, weakly productive, and have the 


previ. pooperty and the immediate property. Fortunately, they are also 


conizicvct-free, hence determinate. 
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9.1.1 Summary 

Our state of knowledge about parallel processing is, at the moment, 
rather primitive. In away, this is not surprising. Individuals seem to 
be incapable of consciously working on more than one thing at a time, so 
we have less "introspective intuition" about parallel computation than we 
do for sequential processing. Only recently have computer hardware and 
software made parallel processing possible, and parallelism at the 
instruction level is still not practical. 

This dissertation has attempted to do some groundwork in parallel 
processing. ‘Recosnizing the lack of practical experience with parallel 
control mechanisms, we have defined a very flexible model for parallel 
computation. Many proposed control mechanisms can be cast in terms of 
our model, allowing meaningful comparison of apparently diverse schemes. 

Our definition of equivalence reflected a modular, input/output 
orientation. A schema is determinate if all the computations which 
might be observed for a given interpretation are equivalent. Although 
we noted that determinacy is not generally decidable, we showed that 
persistent, commutative schemata with the prefix property are determinate 
if they are conflict-free. Many control mechanisms which have been or 
might be proposed are persistent, commutative, and have the prefix 
property. 

Since not all actors alter schema output locations, we introduced 
the notion of productivity to formalize the way in which the occurrence 
of an actor can ultimately affect input/output behavior. It was seen 


that the réles of operators and deciders are quite different, supporting 
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our decision to treat them as distinct actors as opposed to the hybrid 
operations of Karp and Miller’. 

We extended the concept of productivity to schemata, requiring, in 
the weakest form, that no actor initiate unless it might be useful. 
Such weakly productive schemata satisfied some intuitive properties 
such as precluding equivalence of schemata which use different functions 
or predicates. 

Although weak productivity is not decidable in general, it can 
usually be observed in "real programs". Programmers may not have our 
formal notion of productivity in mind when they create programs, but in 
the process of program development, instructions are added "for a 
reason". Generally, at least for good programmers, this satisfies our 
requirements for productivity. 

In the last chapter, we showed how conflicts can lead to non- 
determinacy. Although conditions such as repetition-freeness are less 
natural than others like completeness, the conditions for Theorem 8.13 
are not unlikely to be met. This argues for models such as Dennis- 
Fosseen schemata in which conflicts simply cannot occur. At the very 
least, it seems we should favor models where detection of conflicts 


is uncomplicated. 
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Oe 2ie Future Directions 

We would like to see the issue of identity operators in Theorem 8.13 
put to rest. We suspect that "non-repetitious" identity operators will 
not alter the validity of the theorem. It may also be possible to relax 
repetition-freeness which seems to be an unnecessarily restrictive 
condition. 

We share with other schematologists a desire to see a better model 
for dealins with structured data such as arrays and lists. Considering 
the complications introduced by the identity operator alone, we expect 
that allowing "just cnough interpretation" to handle structured data 
without getting into problems of undecidability may be difficult indeed. 

It would be worthwhile to look into treating mathematical properties 
such as associativity and commutativity of actors. [It seems that 
commutative arguments can be modeled in a data flow graph by using the 
same index on arcs from the inputs which commute. (Of course, etmtlarity 
would have to be redefined, but the changes appear straightforward. ) 
Associativity and distributivity might also be handled, but appear to 
lead us toward the treacherous territory of rewrite rules. 

Identifying classes of schemata for which equivaleron in jecidable 
is an important goal. We would hope that such classes could be 
characterized "syntactically" to make the results relatively independent 
oi control mechanism. Schemata which are strongly productive in the 


sense of Section 8.5.2 seem to be a promising start for such a class. 


Those wno have reac 


far in this dis 
other directions for extending our worx. 


5. 


parallel systems. 


Q 
DE 


rtation will probably see 


We hope that the model and 


resented here may be of some use in continuing the study of 


10. 


il. 


12. 


13. 
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Appendix I Schema Systems 

An m-input, n-output schema looks, from our modular viewpoint, very 
much like an operator. Operators always have one output, whereas n can 
be greater than one, but this is not an important distinction. A more 
significant difference is that operators have well defined outputs for 
all interpretations while schemata may have infinite computations for 
some interpretations and therefore may not produce outputs. 

There is a natural way of allowing a schema to use other schemata 
as "macro-operators". Data flow graphs can be extended to allow m-input 
n-output actors with associated schema names. We call this new form of 
actor an application. When an application initiates in a control 
sequence, we can consider that a new aettvatton of the named schema is 
created. The schema input locations for the activation are initialized 
from the input locations of the application, and the activation can then 
proceed concurrently with the sequence which spawned it. Of course, the 
corresponding application termination event in the original sequence 
cannot occur until the activation has completed. If the activation 
does terminate and the application termination event is enabled, it can 
occur, copying the schema output values of the activation into the 
output locations of the application. In a practical system, the storage 
used by the activation could then be deallocated. 

In Figure I.1, we show a simple example of the use of a schema as 
a Macro-operator. Applications appear as hexagons to distinguish them 


from the other actors. 


Z 
CONTROL.={dd, ff ,ddpgg} 
SCHEMA INPUTS.={¥} 


SCHEMA OUTPUTS..={Z} 


CONTROL= {bhhaba, °** } 
SCHEMA INPUTS={U} 


SCHEMA OUTPUTS={W ,X} 


om 
<1) 


Main sequence bhh a 
First activation of § ¢€ qd, £ £ 
Second activation of S$ d 4 gg 


XZ Wo 225 


Fizgure I.1 Use of a schema as a macro-operator 
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The example also shows possible sequences of events from the main 
schema and the two activations of schema S. Note that b, the termination 
of application b, may have been enabled in the sequence from the main 


schema immediately after b initiated, but could not occur until the 


first activation of S completed. An extended dadep graph shows the 
effect of the sequences. Labels from the activations are subscripted 
to avoid confusion. The reader may be able to reconstruct the dadep 
graph step by step from the sequences shown and the description of the 
way application initiations and terminations behave. 

Once one understands the use of schema applications, it is easy to 
see that there is, in principle, no difficulty in allowing schemata to 
apply themselves, or to allow recursive applications of arbitrary 
complexity. In this appendix, we will formalize the notions introduced 
here and give a semi-formal proof that commutativity, persistence and 
the prefix property are sufficient to guarantee determinacy for such 


schema systems. 
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I.2.1 Definition of Terms 

An extended schema S is a schema in which ACTORS may include 
applications with associated names from a set of SCHEMA NAMES. If S. 
is an m,-input , n,-output schema name and a is an application with 
schema name S;> then the data flow graph of S contains arcs labeled 1 
through ms from (not necessarily distinct) locations in MEMORY to a, 
and arcs labeled 1 through ny from a to distinct locations in MEMORY. 
The alphabet of events, 2, of S contains an initiation event a and a 
termination event a for each application a. 

Terms such as well-sequenced, well-defined, persistent, commutative, 
control sequence, and conflitct-free are analogous to the definitions for 
ordinary schemata when applications are treated like operators. In 
particular, xa and xb are in confliet if any output location of a is 
also an input location of b, and similarly for xa and xb. 


A schema system S is a finite collection Si> So» Sk of extended 


"Sh > SCHEMA NAMES ; for lsisk. S, is the 


schemata such that {S$ Ll 


1259 
distinguished main schema. We can assume without loss of generality that 
the alphabets of events of the extended schemata are pairwise disjoint. 


Let E = E,uZ,u"""uE, be the alphabet of events of S. 


k 


The control sets of the extended schemata in a schema system 
determine what we will be defining as the system sequences of the schema 
system. As in the example, system sequences will comprise events from 
the individual schemata. Since there may be several concurrent 
activations of individual schemata, there must be additional structure 


imposed on the sequence of events to indicate which events correspond to 
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which activations. In the example, we used spacing to show this. We add 
this structure formally by defining a mapping w:N?Nu{0} which specifies, 
for each event in the system sequence, the index of an activation to 
which it belongs. That is, ~(i)=j means the io event in the system 
sequence is from the ae activation. We reserve index 0 to identify the 
initial activation of the main schema. The other indexes are important 
only insofar as they uniquely identify activations. 

If x is a sequence of events and ~ is a mapping as described above, 
we will write E(a 3) to denote the sequence formed by eliminating from 
x all those events which are not mapped into i by jp. When is clear 
from context , we may also write this as E(x,i). Since y is defined to 
identify activations of schemata, we expect E(x,i) to be the control 
sequence of some extended schema, or at least a prefix, possibly empty, 
of such a control sequence. 

We need still another formalism to recapture the information present 
in our example. Although py determines the activation in which an event 
occurs, we need to relate application initiations and terminations to 
activations. We formalize this information by another partial mapping 
S:N°N where, if the gg event in sequence x is an application initiation 
or an application termination, t(i) is the index of the activation it 
initiates or terminates. 

For our sample sequence, we can tabularly present some of these 
formalisms as follows. It should be evident that x, ~, and ¢ supply all 


the information we need to reconstruct our example. 


E(@,0) 
E(Z ,1) 


EGY ,2) 


Figure 1.2 


lon) 


0 0 ae 1 aE 0 2 1 2 0 2 
= as = i = 2 a - = Ad a 
h oh a b 
a iis. GE f 
d dp. g 


Formalisms for describing a system sequence 


13 


14 


NO 
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We formalize the notion of a system sequence as follows. (Throughout 
this appendix, we will make no distinction between a function and the 
restriction of that function to a subset of its domain. No serious 
confusion should arise.) 


Suppose x=a * is a sequence in £, and ~ and ¢ are mappings as 


1233 
introduced above. We say that x3;);0 is a system sequence for S if and 


only if the following conditions hold: 


1) wl1)=0. E(%,0) is a control sequence of the main schema, S_, or is 


1 


the prefix of a control sequence of S (Thus, the first thing to 


i 
happen must eome from the main schema.) 

2) Suppose ~(j)#0. Then there is in x an initiation ay of an application 
a with schema name Sy. say the ea initiation of a in E(z,v(i)), such 
that i<j, c(i)=v(j), and E(v,~(j)) is a control sequence of schema Sy 
or a prefix thereof. Furthermore, if there exists an n#i such that 
c(n)=0(4), then a must be the n= termination of a in E(z,¢01.))., 


E(2,v(j)) must be a finite control sequence of S,, and, if t is the 


k 
largest integer for which ¥(t)=¥(j), then t<n. (As complicated as 
this may appear, it simply states that activations are started by 


application initiations, behave like the schema applied, and must 


themselves terminate before the application termination can occur.) 
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We can extend the familiar notion of prefixes to schema systems as 
follows. Let x;~;0 and 2;y';¢' be system sequences of S. Then 1303 
is said to be a prefix of 23;p';¢' if 
aja, "a. 


2) For all i,js<n, o(i)=~(j) if and only if p'(i)=¥'G). 


1) x is a prefix of 2. Say x= 


3) For all i,js<n, if a. is an application initiaition or an application 


termination, then ¢(i)=~(j) if and only if c'Ci)=p'G). 


If conditions 1 through 3 are met and x#z, then we say that ©3930 


is a proper prefix of 23';t'. 


A system sequence x33;t is a control sequence of S if x is infinite, 
er if x is finite and there is no system sequence 23;';¢' of which «33% 
is a proper prefix. In other words, a system sequence is a control 
sequence if it runs forever or if it comes to a halt and no event can 
occur in any activation. This last restriction resembles the prefix 


property extended to schema systems. 


If x3;~;t is a system sequence of S and if x is finite, we can outline 
the definition of the dadep graph of x;~;t by the following induction. 

The dadep graph of A3;¥;2 consists of m, initial values where my is the 
number of schema input locations of the main schema, Si: If X is the gh 
schema input location of Si the rae initial value has label Xo 

In general, all memory location labels in our extended dadep graphs 
will be subscripted with the index of the activation which uses them. 


This will ensure that activations operate independently of one another. 


Only application initiations and terminations will be able to influence 
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labels from another activation and, by definition, there is no activity 
within the influenced activation when the application events occur. 


Suppose H=a,a, "a, a." and we have already defined the dadep 


graph of aa, a, 35055. 


Tf a; is the initiation of an application a with schema name Sie the 


dadep of a Ta, 555 is produced from the dadep graph of ay “Ta, 53030 


1 


by adding labels to values as follows. Suppose X is the sue input location 


of application a, and Y is the ae schema input location of schema Sue 


Then label Y_,., is added to the value bearing label X,,.,. (In a more 
o(i) pi) 

formal definition, we would use an induction hypothesis to guarantee that 

ACE exists as a label on precisely one value in the dadep graph of 


ajay "as 13035. We leave it to the reader to fill in such missing 


details using the definition of ordinary dadep graphs as a model.) 


If a. is the termination of an application a with schema name She 


then we know that Baja," "a; _,5C4)) is a finite control sequence of 


S.+ By the definition of control sequences of individual schemata, all 


output locations of schema S, are assigned values by such an activation. 


k 
We form the dadep graph of aa, 53t from the dadep graph of a, as_43 
w3o by moving and/or adding labels as follows. Suppose X is the jth 


output location of application a, and Y is the oy schema output location 


of schema Sys Then “ud) 


been a label, and X 


is removed from any value on which it may have 
., is added as a label to the value with label Y,,.,. 
pCi) eCr) 
The cases where a; is an operator or decider initiation or termination 
are cirectly analogous to the cases for ordinary dadep graphs. The only 


difference is that they deal with labels subscripted with p(i). We 


therefore omit the details for these cases. 
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An tnterpretation for a schema system S12So2 "Sy simply defines a 
single DOMAIN, functions for each function name in any schema Ss5 
predicates for each sredicate rame in any schema Sis and a set of 
initial elements for the main schema Si: 

Assignment of elements to values, consistency, similarity and other 
terms carry over to extended dadep graphs in a straightforward way. In 
particular, two control sequences 23;;¢ and 23;y';¢' are equivalent if 
both x and 2 are infinite, or if both are finite, and, for each output 
location X of the main schema, the value labeled Xo in the dadep graph of 
x2;~;0 is similar to the value labeled Xo in the dadep graph of 2;p'3c%". 

We say that a schema system S = S1 2852" Sy is persistent, 
commutative, conflict-free and has the prefix property if each extended 
schenata Ss is persistent, commutative, conflict-free and has the prefix 
property. We can show that such a system is determinate by the same 
techniques we used in the earlier proof for unextended schemata. Rather 
than repeat almost verbatim the lemmas proved earlier, we will only 
outline parts of the proof. It will hopefully be possible for interested 
readers to use the formalisms we have introduced to fill in the details 
with as much rigor as they wish. 

Suppose x23;~;¢ and 2;p';z' are I-computations of schema system S. If 
both x and 2 are infinite, the computations are equivalent, and we are 


done. We can therefore suppose that x=a “Ta, and is no longer than 


1%2 
a=b,b,""b""". As before, we will find I-computations ES ELSA 


eh equivalent to 233% such that x;=b,b, . beCi4) Ch. and 


bab,” "bs 30; 35; is a prefix of 23;yp';c'. 
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Letting Ly iW 3 Sq be £330, the basis for our inductive argument is 
established. Suppose Ls 3 bey has been found. This means that 
_ tee ote = ree ws sss 
B bib, by 43>; and Ley bib, Be yess yy Cie wi h 


Z re ‘ae. ‘ F sate S P 
wv _7) = 0 = p'(1). Additionally, for j,k<i, v._1Q) ¥,_,O) iE £ 
re. ere ee : Pia aha 
W'C3) = w'Cx) and 0,03) = b,_, Oc) 1fF 6'G) = oC). 
We consider the activation with index w'(i) in 2;'3¢' in which event 


2 wae “ee : 
b. occurs. Let v Eyrl bib, Dey , w'(i) ). We know that v is the 


prefix of some control sequence and that by can occur next in this 


activation. We will proceed as follows. We will show that there is an 


5 by 433 43534 in which prefix v has 


activation with index Jo in bib 
occurred and for which by can occur next. We will show that if by does 


not occur later in this activation, zs is the proper prefix 


Vs Spa 


of a system sequence in which by does occur in the activation. Since 


this would contradict our hypothesis that Ls is a control 


Patan en 
sequence, we will conclude that b, eventually occurs in the activation 


with index jo: We will then argue that b,'s occurrence can take place 


immediately after prefix b,b,°""b without changing the 


as eee eee 


dadep graph of vs This will provide us with I~computation 


Dare Wega sear 


50.5555 completing the induction. 


146 


We begin by finding the activation of Zs _13¥ with index Jo 


§-2>54-1 
which is "analogous" to the activation of 23;';¢' with index yp'(i). If 


w'(i) = 0, let jp = 9. & C-Bebe Ds » 0 ) = v because p'(1) = 0 


Yaa 
= t(2\ — we * s\ - tre 
and ,_,(1) = 0 and y'(j) = p'(1) iff ¥, 5G) = ¥,_,Q2). If w'G) #0, 
then there is some k < i such that c'(k) = p'(i). Again, by our inductive 
hypotheses, p'(j) = C'(k) iff ¥; 5) - C._,k). In this case, let j, be 
(k). In either case, we see that Hp bb, Diy = )=v. 


). That is, w is the rest of the sequence from 


Pra 


Let vw = E ¢ 


ie ae a 
V4 i-l 0 


the activation of which v was the prefix. 

We now argue that event bs must occur in w, for suppose not. Since 
vb, = Ey bib," “b, ¢ WC Ds b, is enabled after prefix v. By the 
persistence of the individual schemata, vw ; would then be the prefix of 
a control sequence. Suppose bs is the termination of an application. 
(This is the most complicated case. Other types of events can be treated 
in a similar manner.) Let ky be the index in 2 of the corresponding 


application initiation event. That is, k, is the unique index for which 


9] 


6' (ky) = p'(i). Since ko < i, we know by our induction hypotheses that 
5 M t] = (2 2 a ° ° 
for all j<i,t (ky) = p'(j) iff Gs _4 Ck) = ¥,_,Q). Since b; was able 


to occur when it did in 23p'3z', EyrC bybo "bs 5 o'(ky) ) must have 


been a finite control sequence. But E ( b,b,°*"b 


bis, tate Cee) .) 
Ys 4 12 i-l i-1" 0 
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is the same finite control sequence, so b. is free to occur in the 
a 


activation with index i, after prefix b.b,’‘"b - Extend 


Jo 12 Ge Vane oG cs 


Ve and Ce so that v,_, (n+l) = Jj, and C;_, (ntl) 7 +_46ky)> and 


0 


consider x. _b.3y 
i i 


é be ae ee ee ae 
1. If this is a system sequence, Ze 5¥s_ 735s 4 


ape 


is clearly a proper prefix of it, violating the hypothesis that 


x is a control sequence. We have shown that vwb is 


et ago oat 


Ey ( ae af ), the prefix of a control sequence. We know that 
i-1 hs. 


the activation indexed by ,_,(nt1) has already terminated, so this cannot 
prevent by from occurring. The only thing which could prevent 


x from satisfying the definition of a system sequence 


4-24-1375 03 
is that some application termination event in ey terminate the 


activation indexed by i before by can occur. But this would imply 
that some proper prefix of vwb . is a control sequence, contradicting the 
hypothesized prefix property of the individual schemata. Thus, we can 
conclude that event bs occurs in w. 

We have succeeded in showing that w = WCW where CF bee The 


persistence and commutativity of the individual schemata allow us to 


conclude that VC. W WV, is a control sequence or prefix thereof. 


L48 


tere Py Pier, ied, | Sh 

Let ¥,G) = ¥; 40) for j < i and j >k 
= W,_,(k) for j =i 
= ¥;_,G-1) for i<j<k 

Let 0,03) = 0,_,¢)) for j < i and j >k 
= 0,_,(k) for j = i 
= 63_,05-1) | for i<j<k 


In short, Zs5Ws 505 is the result of promoting the occurrence of event 
cy in activation jg of Cs 130333534" By arguments similar to those 
used in showing that b; occurred in wv, Ts 5Ws 3s is a control sequence. 
The sequences observed in all activations except that activation indexed 


by jo remain the same as in Te 3¥5 Ss 4° Ey Cx, i, J) = WW CW, 
i-l1 : 


while Ey Zs Jo ) = DEW jWo- Since the individual schemata are 


conflict-free, it is not too difficult to see that DW Wo and Vo, WW 
do exactly the same thing to the initial values of activation Jo: (A 
formal proof would follow the lines of the dadep indistinguishability 
results for unextended schemata.) Therefore, the dadep graph of 255i 305 


is virtually identical to the dadep graph of x. This means 


4-22¥4-7>54-7° 


©5353 555 is an I-computation equivalent to Zs _ 45037583 4° completing the 


induction. 
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We can now conclude the proof as in the case of ordinary schemata. 
That is, x =b,b,"""bL. If aS BEL is a proper prefix of 2;w's;c', we 
contradict the assumption that ah el is a control sequence. If 
rn and aS is a prefix of 2;p';¢', it follows that the 
sequences are equivalent. (They need not be identical. That is, v4) 
need not equal p'(i). However, the restrictions that 0) = 09) iff 
w'(i) = w'(j) and 64) = an) iff c'(i) = w'(4j) make any differences 
unimportant. That is, the sequences differ only on the "names" of the 
seeieniona) 

Since “3;3;¢ is equivalent to x 3¥ 35, which is, in turn, equivalent 
to 23;¥';¢', the original I-computations are equivalent, and the schema 


is determinate. This concludes the outline of the proof. 
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